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Abstract 

We prove that in various natural models of a random quotient 
of a group, depending on a density parameter, for each hyperbolic 
group there is some critical density under which a random quotient 
is still hyperbolic with high probability, whereas above this critical 
value a random quotient is very probably trivial. We give explicit 
characterizations of these critical densities for the various models. 



Introduction 

What does a generic group look like? 

The study of random groups emerged from an affirmation of M. Gromov 
that "almost every group is hyperbolic" (see |Grolj ). More precisely, fix m 
and and consider the group G presented by ( ai, . . . , Om | • • • , tat ) where 
the rj's are words of length £i in the letters af^. Then the ratio of the number 
of A^-tuples of words rj such that G is hyperbolic, to the total number of A^- 
tuples of words rj, tends to 1 as inf £j — > cx). The first proof of this theorem 
was given by A.Y. Ol'shanskii in jOlslj . and independently by C. Champetier 
in |Chl] . thus confirming Gromov's statement. 

Later, M. Gromov introduced (cf. |Gro2| ) a finer model of random group, 
in which the number of relators is allowed to be much bigger. 

This model goes as follows: Choose at random N cyclically reduced words 
of length £ in the letters af^, uniformly among the set of all such cyclically 
reduced words (recall a word is called reduced if it does not contain a sequence 
of the form Oja"^ or a~^ai and cyclically reduced if moreover the last letter 
is not the inverse of the first one). Let R be the (random) set of these 
words, the random group is defined by the presentation ( ai, . . . , | R). 

Let us explain how A^ is taken in this model. There are (2m)(2m — 1)^~^ ~ 
(2m — 1)^ reduced words of length i. We thus take A^ = (2m — 1)*^ for some 
number d between and 1 called density. 
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The theorem stated by Gromov in this context expresses a sharp phase 
transition between hyperbolicity and triviality, depending on the asymptotics 
of the number of relators taken, which is controlled by the density parame- 
ter d. 

Theorem 1 (M. Gromov, [Gro2| ) - Fix a density d between and 
1. Choose a length £ and pick at random a set R of (2m — lY^ uniformly 
chosen cyclically reduced words of length i in the letters af^, . . . , a^. 

If d < 1/2 then the probability that the presentation {ai, . . . ,am \ R) 
defines an infinite hyperbolic group tends to 1 as i ^ oo. 

If d > 1/2 then the probability that the presentation (ai, . . . ,0^ | R) 
defines the group {e} or Z/2Z tends to 1 as £ ^ 00. 

There was a small mistake in the original proof of Gromov: the proof uses 
van Kampen diagrams, and the case when some relator appears several times 
in a given van Kampen diagram was forgotten (this mistake was apparently 
first detected by R. Kenyon); when no relator appears twice there is much 
more independence in the probabilities and the proof is easier. A complete 
proof of this theorem is included below (section . 

Let us discuss the intuition behind this model. What does the density pa- 
rameter d mean? Following the excellent exposition of Gromov in |Gro2| . we 
assimilate d£ to a dimension. That is, d£ represents the number of "equations" 
we can impose on a random word so that we still have a reasonable chance 
to find such a word in a set of (2m — randomly chosen words (compare 
to the basic intersection theory for random sets stated in section 1^?^ . 

For example, for large £, in a set of 2'^^ randomly chosen words of length 
£ in the two letters "a" and "b", there will probably be some word beginning 
with d£ letters "a". (This is a simple exercise.) 

As another example, in a set of (2m — lY^ randomly chosen words on 
af^, there will probably be two words having the same first 2d£ letters, but 
no more. In particular, if < 1/12 then the set of words will satisfy the 
small cancellation property C"(l/6) (see |GHj for definitions). But as soon 
as (i > 1/12, we are far from small cancellation, and as d approaches 1/2 we 
have arbitrarily big cancellation. 

The purpose of this work is to give similar theorems in a more general 
situation. The theorem above states that a random quotient of the free 
group Fm is hyperbolic. A natural question is: does a random quotient of a 
hyperbolic group stay hyperbolic? 

This would allow in particular to iterate the operation of taking a ran- 
dom quotient. This kind of construction is at the heart of the "wild" group 
constructed in |Gro4] . 

Our theorems precisely state that for each hyperbolic group (with "harm- 
less" torsion), there is a critical density d under which the quotient stays 
hyperbolic, and above which it is probably trivial. Moreover, this critical 
density can be characterized in terms of well-known numerical quantities 
depending on the group. 
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We need a technical assumption of "harmless" torsion (see Definition fTK|l . 
Hyperbolic groups with harmless torsion include torsion-free groups, free 
products of torsion-free groups and/or finite groups (such as PSL2(Zi)), etc. 
This assumption is necessary: Appendix ICl proves that Theorem HI does not 
hold for some hyperbolic groups with harmful torsion ^. 

There are several ways to generalize Gromov's theorem: a good replace- 
ment in a hyperbolic group for reduced words of length £ in a free group 
could, equally likely, either be reduced words of length i again, or elements 
of norm i in the group (the norm of an element is the minimal length of 
a word equal to it). We have a theorem for each of these two cases. We 
also have a theorem for random quotients by uniformly chosen plain words 
(without any assumption). 

In the first two versions, in order to have the number of reduced, or 
geodesic, words of length i tend to infinity with we have to suppose that 
G is not elementary. There is no problem with the case of a quotient of an 
elementary group by plain random words (and the critical density is in this 
case). 

Let us begin with the case of reduced words, or cyclically reduced words 
(the theorem is identical for these two variants). 

We recall the definition and basic properties of the cogrowth of a group 
G in section 11.21 below. Basically, if G is not free, the number of reduced 
words of length i which are equal to e in G behaves like (2m — 1)^^. For a 
free group, r] is (conventionally, by the way) equal to 1/2. It is always at 
least 1/2. 

Theorem 2 (Random quotient by reduced words) Let G be 

a non-elementary hyperbolic group with harmless torsion, generated by the 
elements ai, . . . ,am- Fix a density d between and 1. Choose a length i and 
pick at random a set R of (2m — l)'^^ uniformly chosen (cyclically) reduced 
words of length i in af^. Let {R) be the normal subgroup generated by R. 
Let Tj be the cogrowth of the group G. 

If d < I — r], then, with probability tending to 1 as i oo, the quotient 
G/ {R) is non-elementary hyperbolic. 

If d > 1 — 77, then, with probability tending to 1 as £ — oo, the quotient 
G/{R) is either {e} or Z/2Z. 

We go on with the case of elements on the ^-sphere of the group. 

In this case, for the triviality part of the theorem, some small-scale phe- 
nomena occur, comparable to the occurrence of Z/2Z above (think of a ran- 
dom quotient of Z by any number of elements of norm i). In order to avoid 

^ These results were announced in |0111| without this assumption. I would like to thank 
Prof. A.Yu. Ol'shanskii for having pointed an error in the first version of this manuscript 
regarding the treatment of torsion, which led to this assumption and to Appendix [O 
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them, we take words of norm not exactly but of norm between ^ — L and 
£ + L for some fixed L > (L = 1 is enough) . 

Theorem 3 (Random quotient by elements of a sphere) 

Let G he a non-elementary hyperbolic group with harmless torsion, generated 
by the elements ai, . . . , Om. Fix a density d between and 1. Choose a length 
i. 

Let be the set of elements of G which are of norm between i — L and 
i + L with respect to af^, . . . , (for some fixed L > 0). Let N be the 
number of elements of S^. 

Pick at random a set R of N'^ uniformly chosen elements of S^. Let {R) 
be the normal subgroup generated by R. 

If d < 1/2, then, with probability tending to 1 as £ — > oo, the quotient 
G/ (R) is non-elementary hyperbolic. 

If d > 1/2, then, with probability tending to 1 as £ ^ oo, the quotient 
G/{R) is {e}. 

The two theorems above were two possible generalizations of Gromov's 
theorem. One can wonder what happens if we completely relax the assump- 
tions on the words, and take in our set R any kind of words of length i with 
respect to the generating set. The same kind of theorem still applies, with 
of course a smaller critical density. 

The gross cogrowth 6' of a group is defined in section fOl below. Basically, 
1 — 6* is the exponent (in base 2m) of return to e of the random walk on the 
group. We always have 9 > 1/2. 

Now there are (2m)^ candidate words of length i, so we define density 
with respect to this number. 

Theorem 4 (Random quotient by plain words) Let G be a 

hyperbolic group with harmless torsion, generated by the elements 
Fix a density d between and 1. Choose a length i and pick at random a 
set R of {2mY^ uniformly chosen words of length i in af^. Let {R) be the 
normal subgroup generated by R. 

Let 9 be the gross cogrowth of the group G. 

If d < 1 — 9, then, with probability tending to 1 as £ oo, the quotient 
G/ {R) is non-elementary hyperbolic. 

If d > 1 — 9, then, with probability tending to 1 as i oo, the quotient 
G/{R) is either {e} or Z/2Z. 

Precisions on the models. Several points in the theorems above are left 
for interpretation. 

There is a slight difference between choosing times a random word 
and having a random set of words, since some word could be chosen 
several times. But for d < 1/2 the probability that a word is chosen twice 
is very small and the difference is negligible; anyway this does not affect our 
statements at all, so both interpretations are valid. 
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Numbers such as {2m)'^ are not necessarily integers. We can either take 
the integer part, or choose two constants Ci and C2 and consider taking the 
number of words between Ci{2mY^ and C2(2m)'^^. Once more this does not 
affect our statements at all. 

The case c? = is peculiar since nothing tends to infinity. Say that a 
random set of density is a random set with a number of elements growing 
subexponentially in I (e.g. with a constant number of elements). 

The possible occurrence of Z/2Z above the critical density only reflects 
the fact that it may be the case that a presentation of G has no relators of 
odd length (as in the free group). So, when quotienting by words of even 
length, at least Z/2Z remains. 

Discussion of the models. Of course, the three theorems given above are 
not proven separately, but are particular cases of a more general (and more 
technical!) theorem. This theorem is stated in section 

Our general theorem deals with random quotients by words picked under 
a given probability measure. This measure does not need to be uniform, 
neither does it necessarily charge words of only one given length. It has to 
satisfy some natural (once the right terminology is given...) axioms. The 
axioms are stated in section H73t and the quite sophisticated terminology for 
them is given in section W?2\. 

For example, using these axioms it is easy to check that Theorem El still 
holds when quotienting by words taken in the ball rather than in the sphere, 
or that taking a random quotient by reduced words or by cyclically reduced 
words is (asymptotically) the same, with the same critical density. 

It is also possible to take quotients by words of different lengths, but 
our method imposes that the ratio of the lengths be bounded. This is a 
restriction due to the geometric nature of some parts of the argument, which 
rely on the hyperbolic local-global principle, using metric properties of the 
Cayley complex of the group (cf. appendix Ej) • 

In the case of various lengths, density has to be defined as the supremum 
of the densities at each length. 

The very first model of random group given in this article (the one used 
by Ol'shanskii and Champetier), with a constant number of words of pre- 
scribed lengths, is not the case ci = of our model, since in this model the 
ratio of lengths can be unbounded, which completely prevents the use of 
some geometric methods. However, this model can probably be obtained by 
iterating the process of taking a random quotient at (i = 0, or by using the 
relative small cancellation techniques later developed by Delzant in (D] and 
by Gromov in |(Tro4| . 

But another model encountered in the literature, which consists in uni- 
formly picking a fixed number of words of length between 1 and £, satisfies 
easily our axioms, as it is almost exactly our case c? = 0. Indeed there are 
so much more words of length close to I than close to 0, that the elements 
taken under this model are of length comprised between (1 — e)l and ^ for 
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any e. 

Whereas random plain words or random reduced words can be easily con- 
structed independently of the group, it could seem difficult, at first glance, 
to take a quotient by random elements of a sphere. Let us simply recall 
(cf. |GH] ) that in a hyperbolic group, it is possible to define for each ele- 
ment a normal geodesic form, and that there exists a finite automaton which 
recognizes exactly the words which are normal forms of elements of the group. 

Note that all our models of random quotients depend on a generating 
subset. For example, adding "false generators" (i.e. generators equal to e) to 
our generating sets makes the cogrowth and gross cogrowth arbitrarily close 
to 1, thus the critical density for reduced words and plain words arbitrarily 
small. The case of random quotients by elements of the ^-sphere seems to be 
more robust. 

In [Z], A. Zuk proves that a random quotient of the free group by reduced 
words at density greater than 1/3 has property T. As a random quotient of 
any group is the quotient of a random quotient of the free group by the 
relations defining the initial group, this means that the random quotients we 
consider possess property T as well for reduced words and densities above 
1/3. 

Other developments on generic properties of groups. Other prop- 
erties of generic groups have been studied under one or another model of 
random group. Besides hyperbolicity, this includes topics such as small can- 
cellation properties, torsion elements, topology of the boundary, property T, 
the fact that most subgroups are free, planarity of the Cayley graph, or the 
isomorphism problem; and more are to come. See for example jChlj . |AOj . 

0, 0, EDI, IKSl. 

Random groups have been used by M. Gromov to construct a "wild" group 
related to C*-algebraic conjectures, see |Gro4| . 

The use of generic properties of groups also led to an announcement of 
an enumeration of one-relator groups up to isomorphism, see jKSSj . 

In a slightly different approach, the study of what a generic group looks 
like has very interesting recent developments: genericity can also be under- 
stood as a topological (rather than probabilistic) property in the space of all 
finite type groups. See for example the work of C. Champetier in |(]h3| . 

In all these works, properties linked to hyperbolicity are ubiquitous. 

Acknowledgements. I would like to thank, in alphabetical order, Thomas 
Delzant, Etienne Ghys, Misha Gromov, Claire Kenyon, Richard Kenyon, 
Pierre Pansu, Panos Papasoglu, Frederic Paulin and Andrzej Zuk for in- 
structive talks and comments. 

Special thanks to Prof. A.Yu. Ol'shanksii, who kindly pointed out an er- 
ror in the treatment of torsion in a previous version of the manuscript (the 
assumption of harmless torsion did not appear), which led to the counterex- 
amples of Appendix ini as well as for careful reading and suggestions for the 



6 



text. 

Part of the ideas of this work emerged during my stay at the Ecole normale 
superieure of Lyon in April 2002, at the invitation of Andrzej Zuk. I would 
like to thank all the team of the Mathematics Department there for their 
great warmth at receiving me. 

1 Definitions and notations 

1.1 Basics 

Throughout all this text, G will be a discrete hyperbolic group given by a 
presentation ( ai, . . . , Om | R) where S = {ai, . . . , am, o-i^, ■ ■ ■ a^} is a sym- 
metric set of 2m generators, and R is a finite set of words on S. (Every 
discrete hyperbolic group is finitely presented, cf. (S).) 

We shall denote by 5 a hyperbolicity constant for G w.r.t. S. Let A be 
the maximal length of relations in R. 

A hyperbolic group is called non-elementary if it is neither finite nor 
quasi-isometric to Z. 

A word will be a word made of letters in S. Equality of words will always 
mean equality as elements of the group G. 

A word is said to be reduced if it does not contain a generator a E S 
immediately followed by its inverse a~^. It is said to be cyclically reduced if 
it and all of its cyclic permutations are reduced. 

If is a word, we shall call its number of letters its length and denote 
it by \w\. Its norm, denoted by \\w\\, will be the smallest length of a word 
equal to w in the group G. 

1.2 Growth, cogrowth, and gross cogrowth 

First, we recall the definition of the growth, cogrowth and gross cogrowth of 
the group G with respect to the generating set S. 

Let be the set of all words of length i in af^. Let Sq be the set of all 
elements of G the norm of which is equal to i with respect to the generating 
set af^. The growth g controls the asymptotics of the number of elements of 
Sq-. this number is roughly equal to (2m)^^. The gross cogrowth 9 controls 
the asymptotics of the number of words in which are equal to the neutral 
element in G: this number is roughly equal to (2m)^^. The cogrowth rj is the 
same with reduced words only: this number is roughly (2m — 1)"^^. 

These quantities have been extensively studied. Growth now belongs 
to the folklore of discrete group theory (see e.g. [(jdlH] or |(tK| for back- 
ground and open problems). Cogrowth has been introduced by R. Grig- 
orchuk in |Gri| . and independently by J. Cohen in 121 ■ For some examples 
see |Ch2| or |W1] . Gross cogrowth is linked (see below) to the spectrum of 
the random walk on the group, which, since the seminal work by H. Kesten 
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(see |K1| and |K2| ). has been extensively studied (see for example the nu- 
merous technical results in |W2j and the references therein). 

Definition 5 (Growth, cogrowth, gross cogrowth) 

The growth of the group G with respect to the generating set ai, . . . , 
is defined as ^ 

g = lim - log2^ 

The gross cogrowth of the group G with respect to the generating set 
Oi, . . . , am is defined as 

^ = 7 ^°S2m, e S'^,w = e in G} 

t^oo I 
I even 

The cogrowth of the group G with respect to the generating set ai, . . . ,am 
is defined as 77 = 1/2 for a free group, and otherwise 

7] = lim - log2m_i E S^,w = e in G, w reduced} 

I even 



Let us state some properties of these quantities. All of them are proven 
in |K2|, (nni or p. 

The limits are well-defined by a simple subadditivity (for growth) or su- 
peradditivity (for the cogrowths) argument. We restrict ourselves to even i 
because there may be no word of odd length equal to the trivial element, as 
is the case e.g. in a free group. 

For cogrowth, the logarithm is taken in base 2m — 1 because the number 
of reduced words of length i behaves like (2m — 1)^. 

Cogrowth and gross cogrowth lie between 1/2 and 1. Gross cogrowth is 
strictly above 1/2, as well as cogrowth except for the free group. There exist 
groups with cogrowth or gross cogrowth arbitrarily close to 1/2. 

The probability that a random walk in the group G (with respect to the 
same set of generators) starting at e, comes back to e at time t is equal to 
the number of words equal to e in G, divided by the total number of words of 
length i. This leads to the following characterization of gross cogrowth, which 
states that the return probability at time t is roughly equal to (2m)~*^^~^)*. 
This will be ubiquitous in our text. 

Alternate definition of gross cogrowth Let Pt be the prob- 
ability that a random walk on the group G (with respect to the generating 
set ai, . . . , ttm) starting at e at time 0, comes back to e at time t. 
Then the gross cogrowth of G w.r.t. this generating set is equal to 

9=1+ lim I log2m Pt 

t even 
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In particular, (2m)^~^ is the spectral radius of the random walk operator 
(denoted A in |Klj and r in |Gri| V which is the form under which it is studied 
in these papers. 

A cogrowth, or gross cogrowth, of 1 is equivalent to amenability. 
It is easy to check that gl2 + 6 ^ 1. 

Gross cogrowth and cogrowth are linked by the following equation (see (HHI): 
{2mY = (2m - l)*^* + (2m - 1)^^^ 

The gross cogrowth of the free group Fm is | logj^ (8m — 4), and this is 
the only group on m generators with this gross cogrowth (see |Klj ). This 
tends to 1/2 as m ^ oo. 

There are various conventions for the cogrowth of the free group. The 
definition above would give — oo. In [HI the cogrowth of the free group is 
taken equal to 0; in jOrij it is not defined. Our convention allows the formula 
above between cogrowth and gross cogrowth to be valid even for the free 
group; it is also natural given the fact that, for any group except the free 
group, the cogrowth is strictly above 1/2. Moreover, this leads to a single 
formulation for our random quotient theorem, as with this convention, the 
critical density for quotients by reduced words will be equal to 1 — 77 in any 
case. So we strongly plead for this being the right convention. 

If G is presented as F^/N where is a normal subgroup, cogrowth is 
the growth (in base 2m — 1) of A^. Gross cogrowth is the same considering 
A^ as a submonoid in the free monoid on 2m generators and in base 2m. 

Let A be the Laplacian on G (w.r.t. the same generating set). As the 
operator of convolution by a random walk is equal to 1 — A, we get another 
characterization of gross cogrowth. The eigenvalues lie in the interval [0;2]. 
Let Ao be the smallest one and Aq the largest one. Then the gross cogrowth 
of G w.r.t. this generating set is equal to 

9=1 + \0g2m sup(l - Ao, X'o - 1) 

(We have to consider Ag due to parity problems.) 

Cogrowth and gross cogrowth depend on the generating set. For example, 
adding trivial generators a, = e makes them arbitrarily close to 1. 

1.3 Diagrams 

A filamenteous van Kampen diagram in the group G with respect to the 
presentation {S \ R) will be a planar connected combinatorial 2-complex 
decorated in the following way: 

• Each 2-cell c bears some relator r E R. The number of edges of the 
boundary of c is equal to |r|. 

• If e is an (unoriented) edge, denote by e+ and e_ its two orientations. 
Then e+ and e_ both bear some generator a G S", and these two gen- 
erators are inverse. 
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• Each 2-cell c has a marked vertex on its boundary, and an orientation 
at this vertex. 

• The word read by going through the (oriented) edges of the boundary 
of cell c, starting at the marked point and in the direction given by the 
orientation, is the relator r E R attached to c. 

Note on the definition of regular complexes: we do not require that each closed 2-cell 
be homeomorphic to the standard disc. We only require the interior of the 2-cell to be 
homeomorphic to a disc, that is, the application may be non-injective on the boundary. 
This makes a difference only when the relators are not reduced words. For example, if 
abb~^c is a relator, then the two diagrams below are valid. We will talk about regular 
diagrams to exclude the latter. 




We will use the terms 2-cell and face interchangeably. 

A non-filamenteous van Kampen diagram will be a diagram in which 
every 1- or 0-cell lies in the boundary of some 2-cell. Unless otherwise stated, 
in our text a van Kampen diagram will implicitly be non-filamenteous. 

A n-hole van Kampen diagram will be one for which the underlying 2- 
complex has n holes. When the number of holes is not given, a van Kampen 
diagram will be supposed to be simply connected (0-hole). 

A decorated abstract van Kampen diagram (davKd for short) is defined 
almost the same way as a van Kampen diagram, except that no relators are 
attached to the 2-cells and no generators attached to the edges, but instead, 
to each 2-cell is attached an integer between 1 and the number of 2-cells of 
the diagram (and yet, a starting point and orientation to each 2-cell). 

Please note that this definition is a little bit emended in section lOl f more decoration 
is added). 

A davKd is said to be fulfillable w.r.t. presentation {S \ R) if there exists 
an assignment of relators to 2-cells and of generators to 1-cells, such that 
any two 2-cells bearing the same number get the same relator, and such that 
the resulting decorated diagram is a van Kampen diagram with respect to 
presentation {S \ R). 

A davKd with border Wi, . . . , Wn, where Wi, . . . ,Wn are words, will be a 
(n — l)-hole davKd with each boundary edge decorated by a letter such that 
the words read on the n components of the boundary are wi, . . . ,Wn- A 
davKd with border is said to be fulfillable if, as a davKd, it is fulfillable 
while keeping the same boundary words. 

A word w is equal to the neutral element e in G if and only if some 
no-hole, maybe filamenteous, davKd with border w is fulfillable (see |LS|1. 

A van Kampen diagram is said to be reduced if there is no pair of adjacent 
(by an edge) 2-cells bearing the same relator with opposite orientations and 
with the common edge representing the same letter in the relator (w.r.t. the 
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starting point). A davKd is said to be reduced if there is no pair of adjacent 
(by an edge) 2-cells bearing the same number, with opposite orientations and 
a common edge representing the same letter in the relator. 

A van Kampen diagram is said to be minimal if it has the minimal number 
of 2-cells among those van Kampen diagrams having the same boundary word 
(or boundary words if it is not simply connected). A fulfillable davKd with 
border is said to be minimal in the same circumstances. 

Note that a minimal van Kampen diagram is necessarily reduced: if there 
were a pair of adjacent faces with the same relator in opposite orientations, 
then they could be removed to obtain a new diagram with less faces and the 
same boundary (maybe adding some filaments): 




Throughout the text, we shall use the term diagram as a short-hand 
for "van Kampen diagram or fulfillable decorated abstract van Kampen dia- 
gram". We will use the term minimal diagram as a short-hand for "minimal 
van Kampen diagram or minimal fulfillable decorated abstract van Kampen 
diagram with border". 

1.4 Isoperimetry and narrowness 

There is a canonical metric on the 1-skeleton of a van Kampen diagram (or 
a davKd), which assigns length 1 to every edge. If -D is a diagram, we will 
denote its number of faces by |D| and the length of its boundary by \dD\. 

It is well-known (see (H)) that a discrete group is hyperbolic if and only if 
there exists a constant C > such that any minimal diagram D satisfies the 
linear isoperimetric inequality \dD\ ^ C\D\. We show in Appendix IbI that 
in a hyperbolic group, holed diagrams satisfy an isoperimetric inequality as 
well. 

Throughout all the text, C will be an isoperimetric constant for G. 

The set of 2-cells of a diagram is also canonically equipped with a metric: 
two 2-cells sharing a common edge are defined to be at distance 1. The 
distance to the boundary of a face will be its distance to the exterior of the 
diagram considered as a face, i.e. a boundary face is at distance 1 from the 
boundary. 

A diagram is said to be A-narrow if any 2-cell is at distance at most A 
from the boundary. 

It is well-known, and we show in Appendix IbI in the form we need, that 
a linear isoperimetry implies narrowness of minimal diagrams. 
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2 The standard case: F, 



We proceed here to the proof of Gromov's now classical theorem (Theorem[TI) 
that a random quotient of the free group Fm is trivial in density greater than 
1/2, and non-elementary hyperbolic in density smaller than this value. 

We include this proof here because, first, it can serve as a useful template 
for understanding the general case, and, second, it seems that no completely 
correct proof has been published so far^. 

Recall that in this case, we consider a random quotient of the free group 
Fm on m generators by (2m — lY^ uniformly chosen cyclically reduced words 
of length 

A random cyclically reduced word is chosen in the following way: first 
choose the first letter (2m possibilities), then choose the next letter in such a 
way that it is not equal to the inverse of the preceding one (2m — 1 possibili- 
ties), up to the last letter which has to be distinct both from the penultimate 
letter and the first one (which lets 2m — 2 or 2m — 1 choices depending on 
whether the penultimate letter is the same as the first one). The difference 
between 2m and 2m — 1 at the first position, and between 2m — 1 and 2m — 2 
at the last position is negligible (as £ — *■ oo) and we will do as if we had 
2m — 1 choices for each letter exactly. 

So, for the sake of simplicity of the exposition, in the following we may 
assume that there are exactly (2m — 1)^ reduced words of length with 
2m — 1 choices for each letter. Bringing the argument to full correctness is a 
straightforward exercise. 

2.1 Triviality for d > 1/2 

The triviality of the quotient for d > 1/2 reduces essentially to the well- 
known 

Probabilistic pigeon-hole principle Let £ > and put A^^/^+e 
pigeons uniformly at random among N pigeon-holes. Then there are two 
pigeons in the same hole with probability tending to 1 as N ^ oo (and this 
happens arbitrarily many times with growing N). 

Now, take as your pigeon-hole the word made of the first £ — 1 letters of a 
random word of length £. There are (2m — 1)^~^ pigeon-holes and we pick up 
(2m — 1)"^ random words with d > 1/2. Thus, with probability arbitrarily 
close to 1 with growing we will pick two words of the form wtti, waj where 
\w\ = £ — 1 and a^, aj G S. Hence in the quotient group we will have = aj. 

But as d is strictly bigger than 1/2, this will not occur only once but 
arbitrarily many times as £ — > oo, with at each time and being chosen 
at random from S. That is, for big enough £, all couples of generators a,b E S 

^ Since the proof included here was written and diffused, a similar but somewhat simpler 
proof has been published in [Zj for a slightly different model in which relators are of length 
3 but the number of generators m tends to infinity. 
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will satisfy a = 6 in the quotient group. As S is symmetric, in particular 
they will satisfy a = a~^. 

The group presented by ( (cj) | aj = a^^.ai = aj ) is Z/2Z. In case 
£ is even this is exactly the group we get (as there are only relations of even 
length), and if i is odd any relation of odd length turns Z/2Z into {e}. 

This proves the second part of Theorem [H 

2.2 Hyperbolicity for d<l/2 

We proceed as follows: We will show that the (reduced) davKd's which are 
fulfillable by a random presentation necessarily satisfy some linear isoperi- 
metric inequality. This is stronger than proving that only minimal diagrams 
satisfy an isoperimetric inequality: in fact, all reduced diagrams in a random 
group satisfy this inequality. (Of course this cannot be true of non-reduced 
diagrams since one can, for example, take any relator r and arrange an arbi- 
trarily large diagram of alternating r's and r~^'s like in a chessboard.) 

Thus we will evaluate the probability that a given decorated abstract van 
Kampen diagram can be fulfilled by a random presentation. We show that if 
the davKd violates the isoperimetric inequality, then this probability is very 
small and in fact decreases exponentially with i. 

Then, we apply the Cartan-Hadamard-Gromov theorem for hyperbolic 
spaces, which tells us that to ensure hyperbolicity of a group, it is not nec- 
essary to check the isoperimetric inequality for all diagrams but for a finite 
number of them (see section El for details). 

Say is it enough to check all diagrams with at most K faces, where K 
is some constant depending on d but not on i. Assume we know that for 
each of these diagrams which violates the isoperimetric inequality, the prob- 
ability that it is fulfillable decreases exponentially with £. Let D{K) be the 
(finite) number of davKd's with at most K faces, violating the isoperimet- 
ric inequality. The probability that at least one of them is fulfillable is less 
that D{K) times some quantity decreasing exponentially with £, and taking 
£ large enough ensures that with probability arbitrarily close to one, none 
of these davKd's is fulfillable. The conclusion then follows by the Cartan- 
Hadamard-Gromov theorem. 

The intuitive basic picture is as follows: Consider a davKd made of two 
faces of perimeter £ meeting along L edges. The probability that two given 
random relators r, r' fulfill this diagram is at most (2m — 1)~^, which is the 
probability that L given letters of r are the inverses of L given letters of r' . 
(Remember that as the relators are taken reduced, there are only 2m — 1 
choices for each letter except for the first one. As 2m — 1 < 2m we can safely 
treat the first letter like the others, as doing otherwise would still sharpen 
our evaluation.) 
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Now, there are {2m — relators in the presentation. As we said, the 
probability that two given relators fulfill the diagram is at most (2m — 
Thus, the probability that there exist two relators in the presentation ful- 
filling the diagram is at most (2m — l)^'^(2m — 1)"^, with the new factor 
accounting for the choice of the two relators. 

This evaluation becomes non-trivial for L > 2di. Observe that the bound- 
ary length of the diagram is 2i - 2L = 2(1 - 2d)e - 2{L - 2di). That is, if 
L ^ 2di then the boundary is longer than 2(1 — 2d)i, and if L > 2di then 
the probability that the diagram can be fulfilled is exponentially small with 
i. 

To go on with our intuitive reasoning, consider a graph with n relators 
instead of two. The number of "conditions" imposed by the graph is equal to 
the total length L of its internal edges, that is, the probability that a random 
assignment of relators satisfy them is (2m — 1)~^, whereas the number of 
choices for the relators is (2m — l)"'^^ by definition. So if L > ndi the 
probability is too small. But if L ^ ndi, then the boundary length, which 
is equal to ni — 2L, is bigger than (1 — 2d)n£ which is the isoperimetric 
inequality we were looking for. 

This is the picture we will elaborate on. In fact, what was false in the 
last paragraph is that if the same relator is to appear several times in the 
diagram, then we cannot simply multiply probabilities as we did, as these 
probabilities are no more independent. 

Thus, let D he a reduced davKd. We will evaluate the probability that 
it can be fulfilled by relators of a random presentation. Namely 

Proposition 6 - Let D be a reduced davKd. The probability that D 
can be fulfilled by relators of a random presentation is at most (2m — 

lY\dD\-l\D\il-2d))/2\D\_ 

Proof - Each face of D b number between 1 and \D\. Let n be the 

number of distinct numbers the faces bear in D. Of course, n ^ l^l. (The 
original proof by Gromov was valid only when n = \D\, so that all relators 
are chosen independently, which simplifies the proof. If n < \D\ then we 
cannot simply multiply probabilities as in the basic picture.) Suppose, for 
simplicity, that these n distinct numbers are 1, 2, . . . , n. 

To fulfill D is to give n relators ri, . . . , r„ satisfying the relations imposed 
by the diagram. 

We will construct an auxiliary graph T summarizing all letter relations im- 
posed by the diagram D. Vertices of T will represent the letters of ri, . . . , r„. 
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and edges of T will represent inverseness (or equality, depending on orienta- 
tion) of letters imposed by shared edges between faces of D. 

Thus, take ni vertices for F, arranged in n parts of i vertices. Call the 
vertices corresponding to the faces of D bearing number i the i-th part of 
the graph. Each part is made of £ vertices. 

We now explain what to take as edges of T. 

In the diagram, every face is marked with a point on its boundary, and 
an orientation. Label the edges of each face 1,2, ... ,i starting at the marked 
point, following the given orientation. 

If, in the davKd D, the k-th edge of a face bearing number i is equal to 
the k'-th edge of an adjacent face bearing number j, then put an edge in T 
between the k-th vertex of the i-th part and the k'-th vertex of the j-th part. 
Decorate the newly added edge with —1 if the two faces' orientations agree, 
or with +1 if they disagree. 

Thus, a —1 edge between the k-th vertex of the i-th part and the k'-th 
vertex of the j-th part means that the k-th letter of relator rj has to be the 
inverse of the k'-th letter of relator rj. 

Successively add an edge to T in this way for each interior edge of the 
davKd D, so that the total number of edges of T is equal to the number of 
interior edges of D. 

As D is reduced, the graph F can contain no loop. It may well have 
multiple edges, if, in the davKd, several pairs of adjacent faces bear the same 
numbers and have common edges at the same position. 

Note that this graph only depends on the davKd D and in no way on the 
random presentation. 

The graph F for the basic picture above is: 



r r' 




Now let us evaluate the probability that D is fulfillable. To fulfill D is to 
assign a generator to each vertex of F and see if the relations imposed by the 
edges are satisfied. 

Remark that if the generator of any vertex of the graph is assigned, then 
this fixes the generators of its whole connected component. (And, maybe, 
depending on the signs of the edges of F, there is no correct assignation at 
all.) Thus, the number of degrees of freedom is at most equal to the number 
of connected components of F. 
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Thus (up to our approximation on the number of cyclically reduced 
words), the number of random assignments of cyclically reduced words to 
the vertices of T is (2m — 1)"^, whereas the number of those assignments 
satisfying the constraints of the edges is at most (2m — 1)*" where C is the 
number of connected components. Hence, the probability that a given as- 
signment of n random words to the vertices of T satifies the edges relations 
is at most (2m — 1)*^""^. 

This is the probability that n given relators of a random presentation ful- 
fill the diagram. Now there are (2m — l)*^^ relators in a random presentation, 
so the probability that we can find n of them fulfilling the diagram is at most 
(2m - 1)""'^ (2m - 1)^""^. 

Now let Ti be the subgraph of T made of those vertices corresponding to 
a face of D bearing a number ^ i. Thus Fi C r2 C . . . C r„ = F. Of course, 
the probability that F is fulfillable is less than any of the probabilities that 
Fj is fulfillable for i ^ n. 

The above argument on the number of connected components can be 
repeated for Fji the probability that Fj is fulfillable is at most (2m— 1)*'^^"'"'"*"*^ 
where Cj is the number of connected components of Fj. 

This leads to setting 

di = id£ + Ci-ii 

and following Gromov we interpret this number as the dimension of Fj, or, 
better, the dimension of the set of random presentations for which there exist 
i relators satisfying the conditions imposed by Fj. Thus: 

Fi{D is fulfillable) ^ (2m - l)*^' \/i 

Before concluding we need a further purely combinatorial lemma. 

Lemma 7 

\dD\^i\D\{l- 2d) + 2^dj(mj -mj+i) 

where rrii,! ^ i ^ n is the number of faces of D hearing relator number i. 

Before proving the lemma let us end the proof of the proposition. We are 
free to choose the order of the construction, and we may suppose that the 
mj's are non-increasing, i.e. that we began with the relator appearing the 
biggest number of times in D, etc., so that mj — mj+i is non-negative. 

If all (ij's are non-negative, then we have the isoperimetric inequality 
\dD\ ^ i\D\ (1 — 2d) and the proposition is true since the probability at play 
is at most 1. 

If some di is negative, we use the fact established above that the probabil- 
ity that the diagram is fulfillable is less than (2m— 1)™^'^\ As ^mj = \D\, we 
have^dj(mj-mj+i) ^ |D|infrfj. Thusinfdj^ {\dD\ - i\D\ {1 - 2d)) /2\D 
hence the proposition. □ 

Proof of the lemma - A vertex in the i-th part of F is thus of multi- 
plicity at most mj. Let A be the number of edges in F. We have 

\dD\ ^\D\i-2A = i^rrii - 2A 
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(where the equality \dD\ = \D\i — 2 A holds when D has no filaments). 

Thus we want to show that either the number of edges is small, or the ful- 
fillability probability is small. The latter grows with the number of connected 
components of F, so this looks reasonable. 

Let Ai be the number of edges in Fj. We now show that 



Depart from Fj and add the new vertices and edges of Fj+i. When adding 
the £ vertices, the number of connected components increases by i. So we 
only have to show that when adding the edges, the number of connected 
components decreases at least by 1/mj+i times the number of edges added. 

Call external point a point of Fj+i \ Fj which shares an edge with a point 
of Fj. Call internal point a point of Fj+i \ Fj which is not external. Call 
external edge an edge between an external point and a point of Fj, internal 
edge an edge between two internal points, and external-internal edge an edge 
between an external and internal point. Call true internal point a point 
which has at least one internal edge. 

While adding the external edges, each external point is connected to a 
connected component inside Fj, and thus the number of connected compo- 
nents decreases by 1 for each external point. 

Now add the internal edges (but not yet the external-internal ones): If 
there are true internal points, these make at most N/2 connected compo- 
nents after adding the internal edges, so the number of connected components 
has decreased by at least N/2. 

After adding the external-internal edges the number of connected compo- 
nents still decreases. Thus it has decreased by at least the number of external 
points plus half the number of true internal points. 

Now as each external point is of degree at most mj+i, the number of 
external plus external-internal edges is at most mj+i times the number of 
external points. If there are true internal points, the number of internal 
edges is at most Nmi+i/2 (each edge is counted 2 times). So the total number 
of edges is at most mj+i times the number of external points plus half the 
number of true internal points, which had to be shown. 

Thus we have proved that Aj+i — Aj+mj+i((ij+i — (ij) ^ nii^idi. Summing 
over i yields 



A. 



■i+l — 



Ai + mj+i(rfj+i - di) ^ mi+id£ 



or equivalently that 



Ai+i -A + mj+i(a+i - (C- + £)) ^ 




Thus 




^ i^mi-2di^m.i + 2^mi{di-di_i) 
= i\D\{l -2d) + 2^di{mi-mi+i) 
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as was needed. □ 

Corollary 8 - Let D be a davKD. Then, either D satisfies the isoperi- 
metric inequality 

\dD\ ^ i\D\ (1/2 -rf) 

or the probability that is can be fulfilled by relators of a random presentation 
is at most (2m - i)-^{i/2-rf)/2_ 

Hence the interest of taking d < 1/2... 

This was for a given davKd D. In order to show that the group is hyper- 
bolic, we have to show that the probability that there exists a davKd violating 
the isoperimetric inequality tends to when £ ^ oo. But here we use the 
local-global principle for hyperbolic grometry (or Cartan-Hadamard-Gromov 
theorem, see Appendix E} , which can be stated as: 

Proposition - For each a > 0, there exist an integer K{a) ^ 1 and an 
a' > such that, if a group is given by relations of length £ for some £ and 
if any reduced van Kampen diagram with at most K faces satifies 

\dD\ ^ a£\D\ 

then any reduced van Kampen diagram D satisfies 

\dD\ ^ a'£\D\ 

(hence the group is hyperbolic). 

Now take a = 1/2 — d and the K given by the proposition. If N{K, £) is 
the number of davKd's with at most K faces and each face has £ edges, then 
the probability that one of them is fulfillable and violates the isoperimetric 
inequality is at most N{K,£) (2m - i)-^(V2-rf)/2_ 

Proposition 9 - For fixed K, the number N{K, £) grows polynomially 
with£. Hence, the probability N [K , £) (2m — 1)"^^^/^"'^)/^ tends exponentially 
to as £ oo. 

Proof - Let us evaluate N{K,£). As the relators in the presentation are 
taken to be cyclically reduced, we only have to consider regular diagrams (see 
section HJ . A regular davKd is only a planar graph with some decoration on 
the edges, namely, a planar graph with on each edge a length indicating the 
number of edges of the davKd it represents, and with vertices of degree at 
least 3 (and, as in a davKd, every face is decorated with a starting point, 
an orientation, and a number between 1 and K). Let G{K) be the number 
of planar graphs with vertex degree at least 3. In such a graph there are 
(by Euler's formula) at most 3K edges, so there are at most £^^ choices of 
edge lengths, and we have {2£K)^ choices for the decoration of each face 
(orientation, starting point and number between 1 and K). 
So N{K,£) ^ G{K){2K)^£^^. □ 
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This proves that the quotient is hyperbolic; we now show that it is infinite. 
We can of course use the general argument of section 16.9.11 but there is a 
shorter proof in this case. First, as any reduced diagram satisfies \dD\ ^ 
a'E\D\ ^ the ball of radius a' 1/2 injects into the quotient, hence the 
quotient contains at least one non-trivial element and cannot be {e}. 

Second, we prove that the presentation is aspherical. With our con- 
ventions on van Kampen diagrams, our asphericity implies asphericity of the 
Cayley complex and thus cohomological dimension at most 2 (indeed, thanks 
to the marking of each face by a starting point and a relator number, two 
faces are reducible in a diagram only if they really are the same face in the 
Cayley complex, so that diagram reduction is a homotopy in the Cayley 
complex). This will end the proof: indeed, cohomological dimension at most 

2 implies torsion-freeness (see jBj, p. 187), hence the quotient cannot be a 
non-trivial finite group. 

Indeed, the isoperimetric inequality above is not only valid for minimal 
diagrams, but for any reduced diagram. Now suppose that there is some 
reduced spherical diagram. It will have zero boundary length and thus will 
violate any isoperimetric inequality, hence a contradiction. Thus the presen- 
tation is aspherical. 

This proves Theorem d 

3 Outline of the argument 

Here we explain some of the ideas of the proof of Theorems [2l El and HI 

We will give a general theorem for hyperbolicity of random quotients by 
words taken from some probability measures on the set of all words. We will 
need somewhat technical axioms on the measures (for example, that they 
weight only long words). Here we give a heuristic justification of why these 
axioms are needed. 

We proceed by showing that van Kampen diagrams of the quotient G/ {R) 
satisfy a linear isoperimetric inequality. 

If D is a van Kampen diagram of the quotient, let D' be the subcomplex 
of D made of relators of the presentation of G ("old relators") and D" the 
subcomplex made of relators in R ("new relators"). 

Say the new relators have length of order i where i is much bigger than 
the hyperbolicity constant of G. (This will be Axiom 1.) 

The main point will be that D' is a diagram in the hyperbolic group G, 
and, as such, is narrow (see Appendix IB|) . We show below that its narrowness 
is of order logi. Hence, if £ is big enough, the diagram D can be viewed as 
big faces representing the new relators, separated by a thin layer of "glue" 
representing the old relators. The "glue" itself may contain invaginations in 
the new relators and narrow excrescences on the boundary. 
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3.1 A basic picture 

As an example, let us study a basic picture consisting of two new relators 
separated by some old stuff. Say that two random new relators r, r' are 
"glued" along subwords of length L, L' (we may have L ^ L'). Let w be 
the word bordering the part of the diagram made of old relators, we have 
\w\ = L + L' + o{i). By construction, w; is a word representing the trivial 
element in G. Write w = xux'v where x is a subword of r of length L, x' is 
a subword of r' of length L', and u and v are short words. 




Let us evaluate the probability that such a diagram exists. Take two 
given random relators r, r' in R. The probability that they can be glued 
along subwords x, x' of lengths L, L' by narrow glue in G is the probability 
that there exist short words m, v such that xux'v = e m G. 

If, as in the standard case, there were no glue (no old relators) and r and 
r' were uniformly chosen random reduced words, the probability that r and 
r' could be glued along subwords x, x' of length L (we would have L = L' in 
this case) would be (2m — 1)~^. But we now have to consider the case when 
then X and x' are equal, not as words, but as elements of G (and up to small 
words u and w, which we will neglect). 

If, for example, the relators are uniformly chosen random words, then 
X and x' are independent subwords, and the probability that x and x' are 
(almost) equal in G is the probability that xx'~^ = e; but xx'~^ is a uni- 
formly chosen random word of length L + L' , and by definition the proba- 
bility that it is equal to e is controlled by the gross cogrowth of G: this is 
roughly (2m)~^^^^^^^~^^'^ (recall the alternate definition of gross cogrowth in 
section . 

In order to deal not only with uniformly chosen random words but with 
other situations such as random geodesic words, we will need a control on 
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the probability that two relators can be glued (modulo G) along subwords 
of length L and L'. This will be our Axiom 3: we will ask this probability 
to decrease like (2m) "^^^"''^ ^ for some exponent /5 (equal to 1 — 6^ for plain 
random words). 

Now in the simple situation with two relators depicted above, the length 
of the boundary of the diagram is not exactly 2i — L — L' , since there can be 
invaginations of the relators, i.e. long part of the relators which are equal to 
short elements in G (as in the left part of the picture above). In the case of 
uniformly chosen random relators, by definition the probability that a part 
of length L of a relator is (nearly) equal to e in G is roughly {2m)~^^~^^^ . So, 
again inspired by this case, we will ask for an axiom controlling the length 
of subwords of our relators. This will be our Axiom 2. 

Axiom 4 will deal with the special case when r = r'~^, so that the words x 
and x' above are equal, and not at all chosen independently as we implicitly 
assumed above. In this case, the size of centralizers of torsion elements in 
the group will matter. 

This was for given r and r'. But there are (2m) ''^ relators in R, so 
we have {2mY'^ choices for r, r'. Thus, the probability that in R, there 
are two new relators that glue along subwords of length L, L' is less than 
(2m)2'^^(2m)-^(^+^'). 

Now, just observe that the length of the boundary of the diagram is (up 
to the small words u and v) 2i — L — L' . On the other hand, when d < P, the 
exponent 2di — [3{L + L') of the above probability will be negative as soon 
as L + L' is bigger than 21. This is exactly what we want to prove: either 
the boundary is big, or the probability of existence of the diagram is small. 

This is comparable to the former situation with random quotients of the 
free group: in the free group, imposing two random relators to glue along 
subwords of lengths L and L' = L results in L "equations" on the letters. 
Similarly, in the case of plain random words, in a group of gross cogrowth 9, 
imposing two random words to glue along subwords of lengths L, L' results 
in P{L + L') "equations" on these random words, with [3 = 1 — 6. 

Now for diagrams having more than two new relators, essentially the 
number of "equations" imposed by the gluings is (3 times the total internal 
length of the relators. The boundary is the external length. If there are n 
new relators and the total internal length is A, then the boundary is roughly 
ni — A. But the probability of existence of such a diagram is (2m)-^^(2m)'^'^^ 
where the last factor accounts for the choice of the n relators among the 
{2mY^ relators of R. So if d < /?, as soon as A > nld/(3, the probability 
decreases exponentially with otherwise, the boundary is longer than n^(l — 
d/P). 

3.2 Foretaste of the Axioms 

As suggested by the above basic picture, we will demand four axioms: one 
saying that our random relators are of length roughly £, another saying that 
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subwords of our relators are not too short, another one controlling the prob- 
ability that two relators glue along long subwords (that is, the probability 
that these subwords are nearly equal in G), and a last one controlling the 
probability that a relator glues along its own inverse. 

As all our estimates are asymptotic in the length of the words considered, 
we will be allowed to apply them only to sufficiently long subwords of our 
relators (and not to one individual letter, for example), that is, to words of 
length at least e£ for some e. 

Note that in order to be allowed to apply these axioms to any subword of 
the relators at play, whatever happens elsewhere, we will need to ask that dif- 
ferent subwords of our relators behave quite independently from each other; 
in our axioms this will result in demanding that the probability estimates 
hold for a subword of a relator conditionnally to whatever the rest of the 
relator is. 

This is a strong independence condition, but, surprisingly enough, is it 
valid not only for uniformly chosen random words (where by definition every- 
thing is independent, in any group), but also for randomly chosen geodesic 
words. This is a specific property of hyperbolic groups. 

Several exponents will appear in the axioms. As we saw in the basic 
picture, the maximal density up to which the quotient is non-trivial is exactly 
the minimum of these exponents. Back to the intuition behind the density 
model of a random quotient (see the introduction), the exponents in our 
axioms indicate how many equations it takes in G to have certain gluings 
in our relators, whereas the density of the random quotient is a measure of 
how many equations we can reasonably impose so that it is still possible to 
find a relator satisfying them among our randomly chosen relators. So this 
intuition gets a very precise numerical meaning. 

4 Axioms on random words implying hyperbol- 
icity of a random quotient, and statement of 
the main theorem 

We want to study random quotients of a (non-elementary) hyperbolic group 
G by randomly chosen elements. Let jj^i be the law, indexed by some param- 
eter i to tend to infinity, of the random elements considered. 

We will always assume that fi£ is a symmetric measure, i.e. for any x G G, 
we have fie{x) = fii{x~^). 

We will show that if the measure satisfies some simple axioms, then the 
random quotient by elements picked under the measure is hyperbolic. 

For each of the elements of G weighted bu fii, fix once and for all a 
representation of it as a word (and choose inverse words for inverse elements), 
so that fie can be considered as a measure on words. Satisfaction of our 
axioms may depend on such a choice. 



22 



Let /if" be the law /i^ restricted (and rescaled) to words of length L (or 
if there are no such words in the support of //). In most applications, (ii will 
weight only words of length £, but we will occasionally use laws /i^ weighting 
words of length comprised between, say, A^ and Bi. 

To pick a random set R of density at most d is to pick, for each length L, 
independently, at most (2m) '^^ random words of length L according to law 
H^. That is, for each length, the density is at most d. 

(We say "at most" because we do not require that exactly (2m) ''^ words 
of length L are taken for each L. Taking smaller R will result in a hyperbolic 
quotient as well.) 

We want to show that if d is less than some quantity depending on //^ 
(and G, since /i^ takes value in G), then the random quotient G/ {R) is very 
probably non-elementary hyperbolic. 

4.1 Asymptotic notations 

By the notation f{i) ^ g{£) we shall mean that 

1 1 
lim -log/(£) = lim ^log^(^) 

I— too I £— »oo I 

We define the notation f{i) < g{i) similarly. We will say, respectively, 
that / is roughly equal or roughly less than g. 

Accordingly, we will say that f{i,L) ^ g{^,L) uniformly for all L ^ £ if 
whatever the sequence L{i) ^ i is, we have 

lim ]\ogf{i,L{i)) = lim ]hgg{i,L{i)) 

i—>-oo I >oo I 

and if this limit is uniform in the sequence L{i). 

4.2 Some vocabulary 

Here we give technical definitions designed in such a manner that the axioms 
can be stated in a natural way. We recommend to look at the axioms first. 

Let a; be a word. For each a, b in [0; 1] such that a + 6 ^ 1, we denote by 
Xa;b the subword of x going from the (a |x|)-th letter (taking integer part, and 
inclusively) to the {{a + b) |x|)-th letter (taking integer part, and exclusively), 
so that a indicates the position of the subword, and b its length. If a + 6 > 1 
we cycle around x. 

Definition 10 - Let Pi be a family of properties of words, indexed by 
the integer i. We say that 

for any subword x under fi£, Pr(P^(a;)) < p{i) 

if for any a, 6 G [0; 1], 6 > 0, whenever we pick a word x according to fi£ we 
have 

Pr (PiiXa-b) \\x\, Xo-a) < p{i) if a + b ^ 1 
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or 

Pr {Pe{xa-b) \ \x\, Xa+b-i-a) < p(^) if a + b > 1 

and if moreover the constants implied in < are uniform in a, and, for each 
e > 0, uniform when b ranges in the interval [e; 1]. 

That is, we pick a subword of a given length and ask the probability 
to be bounded independently of whatever happened in the word up to this 
subword (if the subword cycles around the end of the word, we condition by 
everything not in the subword). 

We also have to condition w.r.t. the length of the word since in the defi- 
nition of a random set of density d under /i^ above, we made a sampling for 
each length separately. 

It would not be reasonable to ask that the constants be independent of 
b for arbitrarily small b. For example, if Hi consists in choosing uniformly a 
word of length £, then taking b = 1/i amounts to considering subwords of 
length 1, which we are unable to say anything interesting about. 

We give a similar definition for properties depending on two words, but 
we have to beware the case when they are subwords of the same word. 

Definition 11 - Let Pi be a family of properties depending on two words, 
indexed by the integer i. We say that 

for any two disjoint subwords x, y under fii, PY{Pe{x, y)) < p{t) 

if for any a, 6, a', b' G [0; 1] such that 6 > 0, 6' > 0, a + 6 ^ 1, a' + 6' ^ 1, 
whenever we pick two independent words x, x' according to we have 

Pr {Pl{Xa-b, I \x\ , \x'\ , Xo;a, x'^.^,) < p{i) 

and if for any a, b, a', b' G [0; 1] such that a < a + b ^ a' < a' + b' ^ 1, 
whenever we pick a word x according to we have 

Pr {Pi{Xa,b, Xa'-b') \ \x\, \x'\ , XQ-a, Xa+b;a') < 

We give similar dehnitions when a + b > 1 or a' + b' > 1, conditioning by 
every subword not in Xa;b or x'^,.y. 

Furthermore, we demand that the constants implied in < be uniform in 
a, a', and, for each e > 0, uniform when b, b' range in the interval [e; 1]. 

We are now ready to express the axioms we need on our random words. 
4.3 The Axioms 

Our first axiom states that fii consists of words of length roughly i up to 
some constant factor. This is crucial for the hyperbolic local-global principle 
(Appendix EJ. 

Axiom 1 - There is a constant Ki such that fi£ weights only words of length 
between i/ ni and nii. 
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Note this axiom applies to words picked under /i^, and not especially 
subwords, so it does not rely on our definitions above. But of course, if 
^ Kii, then \xa-b\ ^ bnii. 

Our second axiom states that subwords do not probably represent short 
elements of the group. 

Axiom 2 - There are constants K2, (32 such that for any subword x under 
111, for any t ^1, we have 

Vi{\\x\\ ^ K2\x\{l-t)) < (2m)-^2*l"l 

uniformly in t. 

Our next axiom controls the probability that two subwords are almost 
inverse in the group. We will generally apply it with n{i) = 0(log£). 

Axiom 3 - There are constants and 73 such that for any function n = 
n{i), for any two disjoint subwords x,y under ni, the probabiUty that there 
exist words u and v of length at most n, such that xuyv = e in G, is roughly 
less than (2m)^3n(^2m)-^3(|x|+|y|)_ 

Our last axiom deals with algebraic properties of commutation with short 
words. 

Axiom 4 - There exist constants and 74 such that, for any function 
n = n{£), for any subword x under fii, the probability that there exist words 
u and V of length at most n, such that ux = xv and u ^ e, v ^ e, is roughly 
less than (2m)^"(2m)-^4|x| 

If G has big centralizers, this axiom will probably fail to be true. We will 
see below (section H75|l that, in a hyperbolic group with "strongly harmless" 
torsion, the algebraic Axiom 4 is a consequence of Axioms 1 and 3 combined 
with a more geometric axiom which we state now. 

Axiom 4' - There are constants P4' and 'j4^' such that, for any C > 0, for 
any function n = n{£), for any subword x under fie, the probability that there 
exists a word u of length at most n such that some cyclic permutation x' of 
xu satisfies \\x'\\ ^ C\og(., is roughly less than (2m)'^4'"(2m)~'^4'|a;|_ 

Remark 12 - Let /i^ be a family of measures such that /i^ < /i^. As our 
axioms consist only in rough upper bounds, if the family ji£ satisfy them, 
then so does the family /i^. 

Note that as we condition every subword by whatever happened before 
(i.e. by what the rest of the word is up to the position of the subword), our 
axioms imply that subwords at different places are essentially independent. 
This is of course true of plain random words, but also of geodesic words and 
reduced words as we will see below. 

In |Gro4| . p. 139-141, M. Gromov uses similar-looking properties. His 
pri is similar to our Axiom 2, and his prs controls the same kind of event as 
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our Axiom 4. We no not use any analogue of his pr2, and analogues of our 
Axioms 1 and 3 are indeed present in |Gro4| but in a more "diffuse" way in the 
paper. Also note that in |Gro4j emphasis is put on very small densities, so 
that the properties considered therein are of the form "such event is realized 
with probability exponentially close to 1", whereas since we work in large 
densities we have to get a precise control of the tails of the distributions, and 
so our axioms take the form "the probability of a deviation of size L from 
such event is at most exp(— with a tight value of (3 needed. So our 
axioms (which have been found independently of |Gro4| ) are more precise 
quantitatively. 

4.4 The Theorem 

Our main tool is the following 

Theorem 13 - Let G he a non-elementary hyperbolic group with trivial 
virtual centre. Let fie be a family of symmetric measures indexed by i, 
satisfying Axioms 1, 2, 3 and 4. Let R be a set of random words of density 
at most d picked under fif . 

If d < min{i32, Ps, Pi) , then with probability exponentially close to 1 as 
i —>■ oo, the random quotient G/ {R) is non-elementary hyperbolic, as well as 
all the intermediate quotients G/ {R') with R! C R. 

Section El is devoted to the proof. 

Remark 14 - Remark El tells that if the theorem applies to some family 
of measures it applies as well to any family of measures fi'i ^ fie- 

4.5 On torsion and Axiom 4 

We show here that in a hyperbolic group with "harmless" torsion, Axioms 1, 
3 and 4' imply Axiom 4. The proof makes the algebraic nature of this axiom 
clear: in a hyperbolic group, it means that subwords under jj^e are probably 
not torsion elements, neither elements commuting with torsion elements, nor 
close to powers of short elements. 

Recall that the virtual centre of a hyperbolic group is the set of elements 
whose action on the boundary at infinity is trivial. For basic properties 
see (DEI. 

Definition 15 (Harmless torsion) 

A torsion element in a hyperbolic group is said to be strongly harmless if 
its centralizer is either finite or virtually Z. 

A torsion element is said to be harmless if it is either strongly harmless 
or lying in the virtual centre. 

A hyperbolic group is said to be with (strongly) harmless torsion if each 
non-trivial torsion element is (strongly) harmless. 

Harmfulness is defined as the opposite of harmlessness. 
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For example, torsion-free groups are with harmless torsion, as well as free 
products of free groups and finite groups. Strongly harmless torsion is stable 
by free product, but harmless torsion is not. 

Let Hi be a measure satisfying Axioms 1, 3 and 4'. 

Proposition 16 - The probability that, for a subword x under fie, there 
exists a word u of length at most n = n{i) such that xu is a torsion element, 
is roughly less than (2m)'^4'"(^2m)~^4'|a:|_ 

Proof - In a hyperbolic group, there are only finitely many conjugacy 
classes of torsion elements (see |GH] . p. 73). Let L be the maximal length of 
a shortest element of a conjugacy class of torsion elements, we have L < oo. 
Now every torsion element is conjugated to an element of length at most L. 

Suppose xu is a torsion element. It follows from Corollary [Hi] that some 
cyclic permutation of it is conjugate to an element of length at most L by 
some word of length at most 51og2 \ xu\ + C'^ + 1 where C'^ is a constant 
depending on the group. In particular, this cyclic conjugate has norm at 
most L + 2(51og2 \ xu\ + + 1). 

Suppose, by Axiom 1, that |x| ^ Kii. 

There are ^ Kii + n cyclic conjugates of xu. The choice of the cyclic 
conjugate therefore only introduces a polynomial factor in i. Let x' denote 
the cyclic conjugate of xu at play. 

Thus we have to evaluate the probability that ^ L + 2(51og2 + 
C'^ + 1). As L and C'^ are mere constants. Axiom 4' precisely says that this 
probability is roug hly less than (2m)T4'"(2m)-^4'l^l. □ 

Proposition 17 - Let w e G. For any subword x under the prob- 
ability that X = w in G is roughly less than (2m) "^^'^^l (uniformly in w). 

Proof - Suppose that the probability that a subword x under fie is equal 
to w is equal to p. Then, by symmetry, the probability that an independent 
disjoint subword y with \y\ = \x\ is equal to is equal to p as well. So 
the probability that two disjoint subwords x and y are inverse is at least p'^. 
But Axiom 3 tells (taking u = v = e) that this probability is roughly at most 
(2m)-^3(l^l+|y|) = (2m)-2^3l^l^ hence p < (2m)-^3l^l □ 

Proposition 18 - Suppose G has strongly harmless torsion, and that 
Axioms 1, 3 and 4' are satisfied. Set /? = min(/53, P^r). 

There is a constant 7 such that for any subword x under fie, the probabil- 
ity that there exist words u, v of length at most n = n{€), such that ux = xv 
in G, with u,v not equal to e, is roughly less than (2m)'''"~^I^L 

So Axiom 4 is satisfied with (3^ = min(/33, /?4/). 

Proof - Denote by x again a geodesic word equal to x in G. 

The words u and v are conjugate (by x), and are of length at most n. 
After Corollary IHI] they are conjugate by a word w of length at most Gn 
where C is a constant depending only on G. 
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Let us draw the hyperbolic quadrilateral xwuw x u . This is a com- 
mutation diagram between xw and u. 




The word xw may or may not be a torsion element. The probability 
that there exists a word w of length at most Cn, such that xw is a torsion 
element, is roughly less than (2m)'^4'C"-/5|^'l Proposition El In this case 
we conclude. 

Now suppose that xw is not a torsion element. Then we can glue the 
above diagram to copies of itself along their w-sides. This way we get two 
quasi-geodesics labelled by {{xw)")n£z that stay at finite distance from each 
other. The element u acting on the first quasi-geodesic gives the second one. 

These two quasi-geodesics define an element x in the boundary of G. This 
element is of course stabilized by xw, but it is stabilized by u as well. This 
means that either u is a hyperbolic element, or (by strong harmlessness) that 
u is a torsion element with virtually cyclic centralizer. 

The idea is that in this situation, xw will lie close to some geodesic A 
depending only on the short element u. As there are not many such A's (and 
as the probability for a random word to be close to a given geodesic behaves 
roughly like the probability to be close to the origin), this will be unlikely. 

First, suppose that u is hyperbolic. Let us use the same trick as above 
with the roles of xw and u exchanged: glue the diagram above to copies of 
itself by the {xw)-side. This defines two quasi-geodesics labelled by {u'^)nez, 
one of which goes to the other when acted upon by xw. 

Namely, let A be a geodesic equivalent to (u"), and set A' = xwA. As xw 
stabilizes the limit of A, A' is equivalent to A. But two equivalent geodesies 
in a hyperbolic group stay at Hausdorff distance at most -Ri where Ri is a 
constant depending only on the group (see |GH| . p. 119). 

The distance from xw to A' is equal to the distance from e to A. By 
Proposition in21 applied to = e, this distance is at most |m| + R2 where 
i?2 is a constant depending only on G. Hence the distance from xw to A is 
at most \u\ + R with R = Ri + R2. Let ?/ be a point on A realizing this 
distance. As \xw\ ^ |x| + \w\, we have \y\ ^ |a;| + |w| + |u| + R. There are 
at most 2 |a;| + 2 \w\ + 2 |n| + 2R + 1 such possible points on A (since A is a 
geodesic). For each of these points, the probability that x falls within distance 
of it is roughly less than (2m)l"l+^"''l'"l(2m)~^l^l by Proposition^ 
applied to all of these points. So the probability that x falls within distance 
less than |n| + -R+ of any one of the possible y's on a given geodesic A is 
roughly less than (2 |x| + 2 + 2 \u\ + 2i? + l)(2m)l"l+^+l'"l(2m)-^l^l which 
in turn is roughly less than (2m)'""~^l'^l as \ w\ ^ Cn and i? is a constant. 

This was for one fixed u. But each different u defines a different A. There 
are at most (2m) I"' ^ (2m)" possibilities for u. Finally, the probability that 
X falls within distance i? + |w| of any one of the geodesies defined by these 
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n's is less than (2m) "+'^""'^1'^' as was to be shown. Thus we can conclude 
when u is hyperbolic. 

Second, if m is a torsion element with virtually cyclic centralizer Z, we 
use a similar argument. Let L as above be the maximal length of a shortest 
element of a conjucacy class of a torsion element. By Proposition IHIH u is 
conjugate to some torsion element u' of length at most L by a conjugating 
word V with |f | ^ \u\ /2 + Ri where Ri is a constant. The centralizer of u' 
is Z' = vZv~^. We know that xw G Z. 

There are two subcases: either Z is finite or Z is virtually Z. 

Let us begin with the former. If Z is finite, let ||Z|| be the maximal norm 
of an element in Z. We have ||Z|| ^ 2 \v\ + \\Z'\\. Let R2 = max \\Z'\\ when u' 
runs through all torsion elements of norm at most L. As xw lies in Z we have 
^ + \\Z\\ ^ + 2 \v\ + R2 ^ \ w\ + \u\ + 2Ri + R2. So by Proposi- 
tion[T3the probability of this event is roughly less than (2m) I"''"'" '"'+^^1+^2 < 
(2m)*-^""'"" as \w\ ^ Cn and as Ri,R2 are mere constants. 

Now if Z is virtually Z, let A be a geodesic joining the two limit points 
of Z. The element u' defined above stabilizes the endpoints of the geodesic 
vA, and so does vxwv~^. 

By Corollary inn vxwv^^ lies at distance at most R{vA) from vA. As 
there are only a finite number of torsion elements u' with ^ L, the 

supremum R of the associated R{vA) is finite, and so, independently of u, 
the distance between vxwv~^ and vA is at most R. 

Now dist(xw,A) ^ \v\ + dist{xwv~^, A) = \v\ + dist{vxwv~^,vA) ^ 
|w| +-R and we conclude exactly as in the case when u was hyperbolic, using 
that \v\ ^ \u\ /2 + Ri. This ends the proof in case u is a torsion element 
with virtually cyclic centralizer. □ 

5 Applications of the main theorem 

We now show how Theorem fT3l leads, with some more work, to the theorems 
on random quotients by plain words, reduced words and geodesic words given 
in the introduction. 

We have three things to prove: 

• first, that these three models of a random quotient satisfy our axioms 
with the right critical densities; 

• second, as Theorem fT3l onlv applies to hyperbolic groups with strongly 
harmless torsion (instead of harmless torsion) , we have to find a way 
to get rid of the virtual centre; 

• third, we have to prove triviality for densities above the critical one. 
Once this is done. Theorems El El and IH will be proven. 

We will have to work differently if we consider quotients by plain random 
words, by random reduced words or by random geodesic words. 
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For instance, satisfaction of the axioms is very different for plain words 
and for geodesic words, because in plain random words, two given subwords 
fo the same word are chosen independently, which is not the case at all a 
priori for a geodesic word. 

Furthermore, proving triviality of a quotient involves small scale phenom- 
ena, which are very different in our three models of random words (think of 
a random quotient of Z by random words of £ letters ±1 or by elements of 
size exactly £). 

These are the reasons why the next three sections are divided in cases, and 
why we did not include these properties in a general and technical theorem 
such as Theorem fT!^ 

Note that it is natural to express the critical densities in terms of the i-th 
root of the total number of words of the kind considered, that is, in base 2m 
for plain words, 2m — 1 for reduced words and (2m)^ for geodesic words. 

5.1 Satisfaction of the axioms 
5.1.1 The case of plain random words 

We now take as our measure for random words the uniform measure on all 
words of length £. Axiom 1 is satisfied by definition. 

In this section, we denote by Bi (as "Brownian") a random word of length 
i uniformly chosen among all (2m)^ possible words. 

Recall 9 is the gross cogrowth of the group, that is, the number of words 
of length i which are equal to e in the group is roughly (2m) for even i. 

Recall the alternate definition of gross cogrowth given in the introduction: 
the exponent of return to e of the random walk in G is 1 — ^. This is at the 
heart of what follows. 

We will show that 

Proposition 19 - Axioms 1, 2, 3, 4' are satisfied by plain random uni- 
formly chosen words, with exponent 1 — 9 (in base 2m). 

By definition, disjoint subwords of a uniformly taken random word are in- 
dependent. So we do not have to care at all with the conditional probabilities 
of the axioms (contrary to the case of geodesic words below). Conditionnally 
to anything else, every subword x follows the law of B\x\. 

The definition of gross cogrowth only applies to even lengths. If i is odd, 
either there are some relations of odd length in the presentation of the group, 
and then the limits holds, or there are no such relations, and the number of 
words of length i equal to e is zero. In any case, this number is < (2m)^^. 

This is a delicate (but irrelevant) technical point: We should care with 
parity of the length of words. If there are some relations of odd length in our 
group, then the limit in the definition of gross cogrowth is valid regardless of 
parity of but in general this is not the case (as is examplified by the free 
group). In order to get valid results for any length, we therefore often have 
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to replace a ~ sign with a < one. In many cases, our statements of the form 
"Pr(. . .) < /(^)" could in fact be replaced by "Pr(. . .) !^ /(£) if £ is even or 
if there are relations of odd length, and Pr(. . .) = otherwise". Here is the 
first example of such a situation. 

Proposition 20 - The probability that Bi is equal to e is roughly less 
than (2m) -(1-^)^ 

Proof - Alternate definition. □ 
Proposition 21 

Pr(||i?,|| ^ f ) < (2m)"(^-^K^-T^^') 

uniformly in i' ^ £. 

In particular, the escaping speed is at least So Axiom 2 is satisfied 
with K2 = ^ and p2 = l- 0. 

Proof - For any L between and t, we have that 

FriBe+L = e) ^ (2m)-^ Pr(||5,|| = L) 

But Fi{B£^L = < (2m)~^^~^^^^'^^'> (and this is uniform in L ^ i since 
in any case, £ + L is at least equal to £), hence the evaluation for a given L. 

Now, summing over L between and i' introduces only a subexponential 
factor in i. □ 

Proposition 22 - The probability that, for two independently chosen 
words Bi and -B^,, there exist words u and v of length at most n = n{i), such 
that BiuB[,v = e in G, is roughly less than (2m)(2+2e)n(2m)-(i-^)(^+^'). 
That is, Axiom 3 is satisfied with exponent 1 — 9. 

Proof - For any word u, we have Pr(i?|„| = u) ^ (2m)~l"l. 

So let u and v be any two fixed words of length at most n. We have 

Pr(5,+|„|+,,+|,| = e) ^ (2m)-l"l-l^l Pr(fi,nB> = e) 

We know that FT{Be+\u\+i'+\v\ = e) < (2m)-(^-^)(^+l"l+^'+l^l). 
So FT{BeuB'^,v = e) < (2m)^(l"l+l^l)(2m)-(i-^)(^+^'). 
Now there are (2m)l"l"'"l^l choices for u and v. □ 

Proposition 23 - The probability that there exists a word u of length at 
most n = n{i), such that some cyclic conjugate of Biu is of norm less than 
Clogi, is roughly less than (2m)(i+^)"(2m)-(i-^)^ 
So Axiom 4' is satisfied with exponent 1 — 9. 

Proof - As above, for any word u, we have Pr(i?|^,| = u) ^ (2m)~l"L So 
any property of B£U occurring with some probability will occur for -B^+|m| 
with at least (2m)~''^l times this probability. We now work with Bi^\u\- 

Any cyclic conjugate of a uniformly chosen random word is itself a uni- 
formly chosen random word, so we can assume that the cyclic conjugate at 
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play is -B^+|m| itself. There are £ + |u| cyclic conjugates, so the choice of the 
cyclic conjugate only introduces a subexponential factor in (. and 

But we just saw above in Propositionl2T]that the probability that || ^ 

L is roughly less than (2m)-(^-^Kl«l+^-i^^) . 

Summing over the (2m)'"l choices for u yields the desired result, taking 
L = Clogi. □ 

So plain random words satisfy our axioms. 



5.1.2 The case of random geodesic words 

The case of geodesic words is a little bit more clever, as subwords of a geodesic 
word are not a priori independent. 

For each element x G G such that = fix once and for all a repre- 
sentation of X by a word of length i. We are going to prove that when fii is 
the uniform law on the sphere of radius £ in G, Axioms 1-4' are satisfied. 

Recall that g is the growth of the group: by definition, the number of 
elements of length £ in G is roughly (2m) As G is non-elementary we have 
g > (otherwise there is nothing to prove). 

Proposition 24 - Axioms 1, 2, 3, 4' are satisfied by random uniformly 
chosen elements of norm i, with exponent 1/2 (in base (2m)^j. 

Our proofs also work if /i^ is the uniform measure on the spheres of radius 
between i — L and £ + L for any fixed L. We will use this property later. 

Note that Axioms 1 and 2 are trivially satisfied for geodesic words, with 
Ki = K2 = I and P2 = 00. 

The main obstacle is that two given subwords of a geodesic word are not 
independent. We are going to replace the model of randomly chosen elements 
of length i by another model with more independence, and prove that these 
two models are roughly equivalent. 

Let Xi denote a random uniformly chosen element on the sphere of radius 
i in G. For any x on this sphere, we have Pr^Xi = x) ^ (2m)~^^. 

Note that for any 5 > 0, for any ei ^ L ^ £ the rough evaluation of the 
number of points of length L by (2m)^^ can by taken uniform for L in this 
interval (take i so that si is big enough). 

First, we will change a little bit the model of random geodesic words. The 
axioms above use a strong independence property of subwords of the words 
taken. This independence is not immediately satisfied for subwords of a given 
random geodesic word (for example, in the hyperbolic group F2 x Z/2Z, the 
occurrence of a generator of order 2 somewhere prevents it from occurring 
anywhere else in a geodesic word) . So we will cheat and consider an alternate 
model of random geodesic words. 

For a given integer A^, let be the product of A^ random uniformly 
chosen geodesic words of length i/N. We will compare the law of Xi to the 
law of X^. 
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Let X G G such that = £. We have Pr(X^ = x) ^ {2m)~3^. Let 
X = X1X2 . . .xn where each Xi is of length i/N. The probability that the 
i-th segment of is equal to Xj is roughly (2m) Multiplying, we get 
Pr(Xf = x)^ (2m) 

Thus, if P is a property of words, we have for any given that 

Pr(P(X,)) < Pr(P(Xf )) 

(The converse inequality is false as the range of values of X^^ is not 
contained in that of X^.) 

Of course, the constants implied in < depend on A^. We are stating that 
for any fixed A^, when i tends to infinity the law of the product of words 
of length i/N encompasses the law of Xi, and not that for a given i, when 
A^ tends to infinity the law of A^ words of length i is close to the law of a 
word of length Ni, which is false. 

We are going to prove the axioms for X^^ instead of Xe. As the axioms all 
state that the probability of some property is roughly less than something, 
these evaluations will be valid for Xi. 

The A^ to use will depend on the length of the subword at play in the 
axioms. With notations as above, if Xa-^b is a subword of length M of Xi, we 
will choose an A^ such that £/N is small compared to bi, so that Xa^b can be 
considered the product of a large number of independently randomly chosen 
smaller geodesic words. This is fine as our axioms precisely do not require 
the evaluations to be uniform when the relative length b tends to 0. 

First, we need to study multiplication by a random geodesic word. 

Let {x\y) denote the Gromov product of two elements x,y E G. That is, 

{x\y) = \ (||x|| + ||y|| - ||x"^y||). 

Proposition 25 - Let x g G and L ^ We iiave 

Pr((x|X,)^L)<(2m)-5^ 
uniformly in x and L ^ i. 

Proof - Let y be the point at distance L on a geodesic joining e to x. By 
the triangle-tripod transformation in exXi, the inequality (x|X^) ^ L means 
that Xi is at distance at most i — L + 46 from y. There are roughly at most 
(2m)^*^^~^+^'^^ such points. Thus, the probability that Xi is equal to one of 
them is roughly less than (2m)f'^^~'^+^'^)~^^ ^ (2m) 

Let us show that this evaluation can be taken uniform in L ^ ^. The 
problem comes from the evaluation of the number of points at distance at 
most i — L + 4:6 from y by (2m)^*^^~'^+^'^): when i — L + AS is not large enough, 
this cannot be taken uniform. So take some e > and first suppose that 
L ^ (1 - e)i, so that i - L + A6 ^ e'i for some e' > 0. The evaluation of the 
number of points at distance at most £ — L + 46 from y by (2m)^(^~^"'"'^'^) can 
thus be taken uniform in L in this interval. 
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Second, let us suppose that L ^ (1 — e)i. Apply the trivial estimate that 
the number of points at distance i — L + 46 ^ ei + 46 from y is less than 
{2mY^~^'^^ . The probability that Xi is equal to one of them is roughly less 
than {2mY^~^^ ^ (2m) "(^"'^^^ uniformly for these values of L. 

So for any e, we can show that for any L ^ £, the probability at play is 
uniformly roughly less than (2m)~*^^~^)^. Writing out the definition shows 
that this exacly says that our probability is less than (2m) uniformly in 
L. □ 

Corollary 26 - Let x e G and L ^ 2i. Then 

Ft i\\xXe\\ ^ \\x\\ + £ - L) < {2m)-^^/'^ 

and 

Pr (llX^xll ^ ||x|| + i-L) < (2m)-'^^/2 
uniformly in x and L. 

Proof - Note that the second case follows from the first one applied to x~^ 
and and symmetry of the law of Xi. 

For the first case, apply Proposition [2^1 to Xi and x~^ and write out the 
definition of the Gromov product. □ 

Proposition 27 - For any fixed N, uniformly for any x G G and any 

L ^ 2£ we have 

Pr {\\xX^\\ ^ \\x\\ +£-L)< (2m)~3^/2 

and 

Pr {\\X^x\\ ^ \\x\\ +£-L) < (2m)-3^/2 

Proof - Again, note that the second inequality follows from the first one 
by taking inverses and using symmetry of the law of X^ . 

Suppose [ I 1 1 ^ L. Let Xi, X2, a: at be iV random uniformly 

chosen geodesic words of length i/N. Let Li ^ 2i/N such that . . . Xi\\ = 
\\ + i/N — Li. Bj N applications of Corollary OEl the probability 
of such an event is roughly less than (2m)~^^^ But-we have ^ L. 

Now the number of choices for the Lj's is at most (2£)^, which is polynomial 
in £, hence the proposition. □ 

Of course, this is not uniform in A^. 

We now turn to satisfaction of Axioms 3 and 4' (1 and 2 being trivially 
satisfied). We work under the model of X^ . Let a; be a subword of X^. 
By taking A^ large enough (depending on \x\/i), we can suppose that x 
begins and ends on a multiple of i/N . If not, throw away an initial and final 
subword of x of length at most £/N . In the estimates, this will change ||x|| 
in ||x|| — 2i/N and, if the estimate to prove is of the form (2m)~^ll'^ll, for each 
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e > we can find an such that we can prove the estimate (2m)~^(^~^)l'^IL 
Now if something is roughly less than (2m)~^'^^~^)ll^ll for every e > 0, it is by 
definition roughly less than (2??7)~^ll^'l'. 

Note that taking N depending on the relative length \ x\ /i of the subword 
is correct since we did not ask the estimates to be uniform in this ratio. 

The main advantage of this model is that now, the law of a subword is 
independent of the law of the rest of the word, so we do not have to care 
about the conditional probabilities in the axioms. 

Proposition 28 - Axiom 3 is satisfied for random geodesic words, witli 
exponent g/2. 

Proof - Let x and y be subwords. The word x is a product of N \ x\ /£ 
geodesic words of length i/N, and the same holds for y. Now take two fixed 
words u, V, and let us evaluate the probability that xuyv = e. 

Fix some L ^ i, and suppose ||a;|| = L. By Proposition [23 starting at e, 
this occurs with probability (2m)~^(l'^l~^)/^. Now we have ^ L — 

but = By Proposition l27l starting at xu this occurs with 

probability (2m)~siL~\\u\\+\yhM)/^ _ 

So the total probability is at most the number of choices for u times the 
number of choices for L times (2m)"3(l^l"'^)/^ times (2m)"^'(^~ll"ll+IJ'l~ll''ll)/2. 
Hence the proposition. □ 

Proposition 29 - Axiom 4' is satisfied for random geodesic words, witli 
exponent g/2. 

Proof - Taking notations as in the definitions, let a: be a subword of 
of length M with 6^1. The law of x is X^^ . 

Note that applying Proposition [23 starting with the neutral element e 
shows that Pr(||x|| ^ L) < (2m)-»(l^l-^)/2_ 

Fix a M of length at most n and consider a cyclic conjugate y of xu. 

First, suppose that the cutting made in xu to get the cyclic conjugate 
y was made in u, so that y = u"xu' with u = u'u" . In this case, we have 

\\y\\ ^ ll^ll ~ ll^'ll ^ ll""!! ^ ll^ll ~ 1^1? ^iid so we have Pr(||?/|| ^ Clog^) ^ 
Pr(||x|| ^ C\ogl+ \\u\\) < (2m)-f(l^'l-^'°g^-l"l)/2 ^ (2m)^'l"l/2-9l^l/2. 

Second, suppose that the cutting was made in x, so that y = x"ux' with 

X = x'x" . 

Up to small words of length at most £/N at the beginning and end of 
X, the words x' and x" are products of randomly chosen geodesic words of 
length £/N. 

Apply Proposition[27lstarting with the element u, multiplying on the right 
by x', then on the left by x". This shows that Pr(||y|| ^ IImH + Ix'I + Ix"!— L) < 
(2m)~^-^/^, hence the evaluation, taking L = \x'\ + \x"\ + ||n|| — Clogi. 

To conclude, observe that there are at most (2m) I"' choices for u and at 
most |x| + |n| choices for the cyclic conjugate, hence an exponential factor 
in \u\. □ 
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5.1.3 The case of random reduced words 

Recall Tj is the cogrowth of the group G, i.e. the number of reduced words of 
length £ which are equal to e is roughly (2m — 1)''^. 

Here we have to suppose m > 1. (A random quotient of Z by reduced 
words of length £ is Z/£Z.) 

Proposition 30 - Axioms 1, 2, 3, 4' are satisfied by random uniformly 
chosen reduced words, or random uniformly chosen cyclically reduced words, 
with exponent 1 — r] (in base 2m — 1). 

The proof follows essentially the same lines as that for plain random 
words. We do not include it explicitly here. 

Nevertheless, there are two changes encountered. 

The first problem is that we do not have as much independence for reduced 
words as for plain words. Namely, the occurrence of a generator at position 
i prevents the occurrence of its inverse at position i + 1. 

We solve this problem by noting that, though the (i + l)-th letter depends 
on what happened before, the {i + 2)-th letter does not depend too much (if 
m > 1). 

Indeed, say the i-th letter is Xj. Now it is immediate to check that the 
{i + 2)-th letter is Xj with probability l/(2m — 1), and is each other letter 
with probability (2m — 2)/(2m — 1)^. This is close to a uniform distribution 
up to a factor of (2m — 2)/ (2m — 1) . 

This means that, conditioned by the word up to the i-th letter, the law 
of the word read after the {i + 2)-th letter is, up to a constant factor, an 
independently chosen random reduced word. 

This is enough to allow to prove satisfaction of the axioms for random 
reduced words by following the same lines as for plain random words. 

The second point to note is that a reduced word is not necessarily cycli- 
cally reduced. The end of a reduced word may collapse with the beginning. 
Collapsing along L letters has probability precisely (2m — and the in- 

duced length loss is 2L. So this introduces an exponent 1/2, but the cogrowth 
T) is bigger than 1/2 anyway. 

In particular, everything works equally fine with reduced and cyclically 
reduced words (the difference being non-local), with the same critical density 
1 — rj. 

5.2 Triviality of the quotient in large density 

Recall G is a hyperbolic group generated by 5 = . Let i? be a 

set of {2mY^ randomly chosen words of length We study G/ (R). 

As was said before, because triviality of the quotient involves small-scale 
phenomena, we have to work separately on plain random words, reduced 
random words or random geodesic words. 

Generally speaking, the triviality of the quotient reduces essentially to the 
following fact, which is analogue to the fact that two (say generic projective 
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complex algebraic) submanifolds whose sum of dimensions is bigger than the 
ambient dimension do intersect (cf. our discussion of the density model of 
random groups in the introduction). 

Basic intersection theory for random sets Let S be a set of 

N elements. Let a, [3 he two numbers in [0; 1] such that a + (3 > 1. Let A be 
a given part of S of cardinal A^". Let B be a set of randomly uniformly 
chosen elements of S. Then A n B with probability tending to 1 as 
N ^ oo (and the intersection is arbitrarily large with growing N). 

This is of course a variation on the probabilistic pigeon-hole principle 
where A = B. 

Remark - Nothing in what follows is specific to quotients of hyperbolic 
groups: for the triviality of a random quotient by too many relators, any 
group (with m > 1 in the reduced word model and g > in the geodesic 
word model) would do. 

5.2.1 The case of plain random words 

We suppose that d > 1 — 9. 

Recall that 9 is the gross cogrowth of the group, i.e. that 

9= lim \ log2„ ^B\w = em G} 

l^ao,t even t 

We want to show that the random quotient G/ {R) is either {1} or Z/2Z. 
Of course the case Z/2Z occurs when £ is even and when the presentation of 
G does not contain any odd-length relation. 

To use gross cogrowth, we have to distinguish according to parity of i. 
We will treat only the least simple case when £ is even. The other case is 
even simpler. 

Rely on the intersection theory for random sets stated above. Take for 
A the set of all words of length i — 2 which are equal to e in G. There are 
roughly (2m)^(^~^) ^ (2m)^^ of them. Take for B the set made of the random 
words of R with the last two letters removed, and recall that R consists of 
(2mY^ randomly chosen words with d > 1 — 9. 

Apply the intersection principle: very probably, these sets will intersect. 
This means that in R, there will probably be a word of the form wab such 
that w is trivial in G and a, b are letters in S or S~^. 

This means that in the quotient G/ {R), we have ab = e. 

Now as d + 9 > 1 this situation occurs arbitrarily many times as £ ^ oo. 
Due to our uniform choice of random words, the a and b above will exhaust 
all pairs of generators of S and S~^. 

Thus, in the quotient, the product of any two generators a,b E S U 
is equal to e. Hence the quotient is either trivial or Z/2Z (and is it trivial as 
soon as i is odd or the presentation of G contains odd-length relators). 

This proves the second part of Theorem HI 
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5.2.2 The case of random geodesic words 

When taking a random quotient by geodesic words of the same length, some 
local phenomena may occur. For example, the quotient of Z by any number 
of randomly chosen elements of norm i will be Z/£Z. Think of the occurrence 
of either {e} or Z/2Z in a quotient by randomly chosen non-geodesic words. 

In order to avoid this phenomenon, we consider a random quotient by 
randomly chosen elements of norm comprised between i — L and i + L for 
some fixed small L. Actually we will take L = 1. 

Recall g is the growth of the group, that is, the number of elements of 
norm i is roug hly (2m)f^, with g > SiS G is non-elementary. 

We now prove that a random quotient of any group G by {2mY^ randomly 
chosen elements of norm £ — 1, £ and £ + 1, with d > g/2, is trivial with 
probability tending to 1 as £ ^ cxd. 

(By taking {2mY^ elements of norm £ + 1 or £ — 1 we mean either 
taking {2mY^ elements of each of these norms, or taking 1/3 at each length, 
or deciding for each element with a given positive probability what its norm 
will be, or any other roughly equivalent scheme.) 

Let a be any of the generators of the group. Let x be any element of 
norm The product xa is either of norm i,i + lovi — 1. 

Let S be the sphere of radius £, we have \ S\ ~ (2m)^^. 

Let R be the set of random words taken. Taking d > g/2 precisely 
amounts to taking more than \S\^^'^ elements of S. 

Let R' be the image of i? by a; ^— > xa. By an easy variation on the 
probabilistic pigeon-hole principle applied to R, there will very probably be 
one element of R lying in R' . This means that R will contain elements x and 
y such that xa = y. Hence, a = e in the quotient by R. 

As this will occur for any generator, the quotient is trivial. This proves 
the second part of Theorem |H1 

5.2.3 The case of random reduced words 

For a quotient by random reduced words in density d > 1 — r] (where rj is 
the cogrowth of the group), the proof of triviality is nearly identical to the 
case of a quotient by plain random words, except that in order to have the 
number of words taken go to infinity, we have to suppose that m ^ 2. 

5.3 Elimination of the virtual centre 

Theorem El only applies to random quotients of hyperbolic groups with 
strongly harmless torsion. We have to show that the presence of a virtual 
centre does not change random quotients. The way to do this is simply to 
quotient by the virtual centre; but, for example, geodesic words in the quo- 
tient are not geodesic words in the original group, and moreover, the growth. 
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cogrowth and gross cogrowth may be different. Thus something should be 
said. 

Recall the virtual centre of a hyperbolic group is the set of elements 
whose action on the boundary at infinity is trivial. It is a normal subgroup 
(as it is defined as the kernel of some action). It is finite, as any element 
of the virtual centre has force 1 at each point of the boundary, and in a 
(non-elementary) hyperbolic group, the number of elements having force less 
than a given constant at some point is finite (cf. |GH| . p. 155). See |01s2| 
or |Ch3| for an exposition of basic properties and to get an idea of the kind 
of problems arising because of the virtual centre. 

Let H be the virtual centre of G and set G' = G/H. The quotient G' has 
no virtual centre. 

5.3.1 The case of plain or reduced random words 

Note that the set R is the same, since the notion of plain random word or 
random reduced word is defined independently of G or G'. 

As {G/H)/{R) = {G/{R))/H, and as a quotient by a finite normal sub- 
group is a quasi-isometry, G/ {R) will be infinite hyperbolic if and only if 
G'/{R) is. 

So in order to prove that we can assume a trivial virtual centre, it 
is enough to check that G and G/H have the same cogrowth and gross 
cogrowth, so that the notion of a random quotient is really the same. 

We prove it for plain random words, as the case of reduced words is 
identical with 9 replaced with r] and 2m replaced with 2m — 1. 

Proposition 31 - Let H be a subset of G, and n an integer. Then 

Pr(3M eG,\u\= n,BeU E H) ^ (2m)"Pr(5^+„ E H) 



Proof - Let iJ„ be the n-neighborhood oiH in G. We have that Pr(i?^+„ E 
H) ^ (2m)-"Pr(fi^ E H,,). □ 

Corollary 32 A quotient of a group by a finite normal subgroup has 
the same gross cogrowth. 

Proof - Let H he a finite subgroup of G and let n = max{\\h\\ , h E H} 
so that H is included in the ra-neighborhood of e. Then Pr(i?£ =g/h e) = 
FriBe E H) ^ E,^„(2m)'= Pr(5,+fe = e) < (2m)-(i-^)^ □ 

Remark - Gross cogrowth is the same only if defined with respect to the 
same set of generators. For example, F2 x Z/2Z presented by a,b,c with 
ac = ca, be = cb and = e has the same gross cogrowth as F2 presented by 
a, b, c with c = e. 

So in this case, we can safely assume that the virtual centre of G is trivial. 
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5.3.2 The case of random geodesic words 

A quotient by a finite normal subgroup preserves growth, so G and G' have 
the same growth. 

But now a problem arises, as the notion of a random element of norm £ 
difi^ers in G and G' . So our random set R is not defined the same way for G 
and G' . 

Let us study the image of the uniform measure on the ^-sphere of G into 
G' . Let L be the maximal norm of an element in H . The image of this sphere 
is contained in the spheres of radius between I — L and i + L. 

The map G — > G' is of index \H\. This proves that the image of the 
uniform probability measure fig on the sphere of radius iinG is, as a measure, 
at most I if I times the sum of the uniform probability measures on the spheres 
of G' of radius between i — L and i + L. In other words, it is roughly less 
than the uniform probability measure ug on these spheres. 

The uniform measure ui on the spheres of radius between i — L and i + L 
(for a fixed L) satisfies our axioms. So we can apply Theorem [THl to the 
quotient of G' by a set R' of random words chosen using measure u^. This 
random quotient will be non-elementary hyperbolic for d < g/2. 

By Remark El for a random set R picked from measure jj,£ (the one we 
are interested in), the quotient G'/ (R) will be non-elementary hyperbolic as 
well. 

But G'/{R) = G/H/{R) = G/{R)/H, and quotienting G/{R) by the 
finite normal subgroup is a quasi-isometry, so G/ {R) is non-elementary 
hyperbolic if and only if G'/ (R) is. 

6 Proof of the main theorem 

We now proceed to the proof of Theorem fT3l 

G is a hyperbolic group without virtual centre generated by S* = af^, . . ., 
. Say that G has presentation {S \ Q). Let i? be a set of random words 

of density at most d picked under the measure We will study G/{R). 
Let P = mm{P2, Ps, Pi) where P2) Pi, Pi are given by the axioms. We 

assume that d < p. 

We will study van Kampen diagrams in the group G/{R). If G is pre- 
sented hy {S \ Q), call old relator an element of Q and new relator an element 
of R. 

We want to show that van Kampen diagrams of G/ (R) satisfy a linear 
isoperimetric inequality. Let D be such a diagram. D is made of old and 
new relators. Denote by D' the subdiagram of D made of old relators and 
by D" the subdiagram of D made of new relators. 

If /3 = there is nothing to prove. Hence we suppose that P > 0. In the 
examples we consider, this is equivalent to G being non-elementary. 
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6.1 On the lengths of the relators 

In order not to make the already complex notations even heavier, we will 
suppose that all the words taken from fii are of length i. So R is made of 
{2rrif^ words of length i. This is the case in all the applications given in this 
text. 

For the general case, there are only three ways in which the length of the 
elements matters for the proof: 

1. As we are to apply asymptotic estimates, the length of the elements 
must tend to infinity. 

2. The hyperbolic local-global theorem of AppendixEl crucially needs that 
the ratio of the lengths of relators be bounded independently of i. 

3. In order not to perturb our probability estimates, the number of distinct 
lengths of the relators in R must be subexponential in i. 

All these properties are guaranteed by Axiom 1. 

6.2 Combinatorics of van Kampen diagrams of the quo- 
tient 

We now proceed to the application of the program outlined in section 01 The 
reader may want to refer to this section while reading the sequel of this text. 

We consider a van Kampen diagram D of Gj {R). Let D' be the part of 
D made of old relators of the presentation of G, and D" the part made of 
new relators in R. 

Redefine D' by adding to it all edges of D": this amounts to adding some 
filaments to D'. This way, we ensure that faces of D" are isolated and that 
D' is connected; and that if a face of D" lies on the boundary of D, we have 
a filament in D', such that D" does not intersect the boundary of D; and 
last, that if the diagram D" is not regular (see section [T] for definition), we 
have a corresponding filament in D' . 




After this manipulation, we consider that each edge of D" is in contact 
only with an edge of D', so that we never have to deal with equalities between 
subwords of two new relators (we will treat them as two equalities to the same 
word) . 
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We want to show that if D is minimal, then it satisfies some isoperimetric 
inequality. In fact, as in the case of random quotients of a free group, we do 
not really need that D is minimal. We need that D is reduced in a slightly 
stronger sense than previously, which we define now. 

Definition 33 - A van Kampen diagram D = D' U D" on G/{R) (with 
D' and D" as above) is said to be strongly reduced with respect to G if there 
is no pair of faces of D" bearing the same relator with opposite orientations, 
such that their marked starting points are joined in D' by a simple path 
representing the trivial element in G. 

In particular, a strongly reduced diagram is reduced. 

Proposition 34 - Every van Kampen diagram has a strong reduction, 
that is, there exists a strongly reduced diagram with the same boundary. 

In particular, to ensure hyperbolicity of a group it is enough to prove the 
isoperimetric inequality for all strongly reduced diagrams. 
Proof - Suppose that some new relator r of D" is joined to some by a 
path w in D' representing the trivial element in G. Then incise the diagram 
along w and apply surgery to cancel r with r^^. This leaves a new diagram 
with two holes w,w~^. Simply fill up these two holes with diagrams in G 
bordered by w (this is possible precisely since w is the trivial element of G). 




Note that this way we introduce only old relators and no new ones in the 
diagram. Iterate the process to get rid of all annoying pairs of new relators. 

□ 

Often in geometric group theory, problems arise when two relators are 
conjugate (or when a conjugate of a relator is very close to another relator), 
and such cases are typically excluded by reinforcing the definition of "strongly 
reduced". In the case of random presentations, however, below the critical 
density it never occurs that two relators are conjugated. So we do not have 
to care about these problems: these cases are automatically wiped off by our 
axioms. 

We will show that any strongly reduced van Kampen diagram D such that 
D' is minimal very probably satisfies some linear isoperimetric inequality. By 
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the local-global principle for hyperbolic spaces (Cartan-Hadamard-Gromov 
theorem, cf. Appendix EI), it is enough to show it for diagrams having less 
than some fixed number of faces. This crucial point considerably simplifies 
the geometric and probabilistic treatment. More precisely, we will show the 
following. 

Proposition 35 - There exist constants a,a' > (depending on G, d 
and the random model but not on i) such that, for any integer K, with 
probability exponentially close to 1 as i ^ oo the set of relators R satisfies 
the following: 

For any van Kampen diagram D = D'UD" satisfying the three conditions: 

• The number of faces of D" is at most K; 

• D' is minimal among van Kampen diagrams in G with the same bound- 
ary; 

• D is strongly reduced with respect to G; 
then D satisfies the isoperimetric inequality 

\dD\ ^ al\D"\+a'\D'\ 



(Of course, the constant implied in "exponentially close" depends on K.) 

Before proceeding to the proof of this proposition, let us see how it implies 
hyperbolicity of the group G/ {R), as well as that of all intermediate quotients. 
This step uses the local-global hyperbolic principle (Appendix EJ, which 
essentially states that it is enough to check the isoperimetric inequality for a 
finite number of diagrams. 

Proposition 36 - There exists an integer K (depending on G and d but 
not on i) such that if the set of relators R happens to satisfy the conclusions 
of Proposition 1^ with £ large enough, then G/{R) is hyperbolic. Better, 
then there exist constants ai,a2 > such that for any strongly reduced 
diagram D such that D' is minimal, we have 

\dD\ ^ ail\D"\+a2\D'\ 



Remark 37 - Proposition l36l implies that a quotient of G by a smaller set 
R' C Ris hyperbolic as well. Indeed, any strongly reduced diagram on R' is, 
in particular, a strongly reduced diagram on R. 

Proof - By our strongly reduction process, for any van Kampen diagram 
there exists another van Kampen diagram D with the same boundary, such 
that D' is minimal (otherwise replace it by a minimal diagram with the same 
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boundary) and D is strongly reduced. Thus, it is enough to show the isoperi- 
metric inequality for strongly reduced diagrams to ensure hyperbolicity. 

We want to apply Proposition [HHl Take for property P in this proposition 
"to be strongly reduced". Recall the notations of the appendix: Lc{D) = \dD\ 
is the boundary length of D, and Ac{D) is the area of D in the sense that a 
relator of length L has area L^. Note that i\D"\ + \D'\^ A^{D)/i. 

Take a van Kampen diagram D such that ^ A^^D) ^ ASOk"^ for 

some k"^ = Ki"^ where K is some constant independent of i to be chosen 
later. As Ad{D) ^ Kf, we have \D"\ ^ K. Proposition EHl for this K tells 
us that Lc{D) = \dD\^ ai\D"\ + a' \D'\ ^ min(a, a')A^{D)/i. Thus 

Lc{Dy ^ mm{a,a'yAc{D)yf ^ mm{a,a')^Ac{D)K/A 

as Ac{D) ^ so taking K = 10^^/ min(a,a')^ is enough to ensure that 

the conditions of Proposition EBl are fulfilled by Ki'^. (The important point 
is that this K is independent of i.) 

The conclusion is that any strongly reduced van Kampen diagram D 
satisfies the linear isoperimetric inequality 

L^{D) ^ A^{D) mm{a, a')/ 10^^ i 

and, fiddling with the constants and using the isoperimetry from D, we can 
even put it in the form 

\dD\ ^ ai£\D"\+a2\D'\ 

if it pleases, where ai,2 depend on G and d but not on i. 

So the proposition above, combined with the local-global hyperbolicity 
principle of Appendix El is sufficient to ensure hyperbolicity. □ 

A glance through the proof can even show that if i is taken large enough, 
the constant a2 in the inequality 

\dD\ ^ aii\D"\+a2\D'\ 

is arbitrarily close to the original isoperimetry constant in G. 

This suggests, in the spirit of |Gro4j . to iterate the operation of taking a 
random quotient, at different lengths £i, then £2, etc., with fast growing £j. 
The limit group will not be hyperbolic (it will be infinitely presented), but 
it will satisfy an isoperimetric inequality like 

\dD\;,a Yl W 

/ face of D 

where £(/) denotes the length of a face. This property could be taken as a 
definition of a kind of loose hyperbolicity, which should be related in some 
way to the notion of "fractal hyperbolicity" proposed in |Gro4| . 
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Now for the proof of Proposition ESI 

We have to assume that D' is minimal, otherwise we know nothing about 
its isoperimetry in G. But as in the case of a random quotient of (sec- 
tion [2l), the isoperimetric inequality will not only be valid for minimal dia- 
grams but for all (strongly reduced) configurations of the random relators. 

If D" = then D = D' is a van Kampen diagram of G and as D' is 
minimal, it satifies the inequality \dD\ ^ C \D\ as this is the isoperimetric 
inequality in G. So we can take a' = G and any a in this case. Similarly, if the 
old relators are much more numerous that the new ones, then isoperimetry 
of G is enough. Namely: 

Lemma 38 - Proposition\^holds for diagrams satisfying \D'\ ^ 4 \ D"\ i/G. 



Proof of the lemma - Suppose that the old relators are much more 
numerous than the new ones, more precisely that \D'\ ^ 4:\D"\i/G. In 
this well, isoperimetry in G is enough to ensure isoperimetry of D. 

Note that D' is a diagram with at most \D"\ holes. We have of course that 
\dD\ ^ \dD'\ - \dD"\ ^ \dD'\ - \D"\i. 

By Proposition inn for diagrams with holes in G, we have that \dD'\ ^ 
C\D'\- \D"\\{2 + 4:a\og\D'\). So, for £ big enough, 

\dD\ ^ \dD'\-\D"\i 

^ G\D'\- \ D"\i - \ D"\ X{2 + 4a\og\D'\) 
^ G\D'\/3 + {G\D'\/3-\D"\i) 

+ {G \D'\ /3 - \D"\X{2 + 4alog\D'\)) 
^ G\D'\/3 + {A\D"\i/3-\D"\i) 

+ {4\D"\£/3-\D"\X{2 + 4a log 4 I I i/G)) 
^ G\D'\/3 + i\D"\/3 

as for i big enough, the third term is positive. So in this case we can take 
a = 1/3 and a' = G/3. □ 

So we now suppose that 1 ^ \D"\ ^ K and that \D'\ ^ 4:\D"\i/G. In 
particular, the boundary length of D is at most \D"\ i + \D'\ X ^ i \D"\ (1 + 
AX/G). 



6.3 New decorated abstract van Kampen diagrams 

We now redefine decorated abstract van Kampen diagrams so that they bet- 
ter fit our needs (the definition given in the introduction fits the case of free 
groups only). The idea is that since D' is very narrow (at the scale of i), at 
scale £ D looks like a van Kampen diagram with respect to the new relators, 
with some narrow "glue" (that is, old relators) between faces. This intuition 
will be formalized using Proposition I72l in Appendix IB| which will help tell 
which parts of the boundary words of the new relators are facing which. 
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The diagram D' has at most K holes. First, after Corollary inHl we can 
suppose that D' is Ei log ^-narrow for some constant Ei depending on G and 
K but not on I (here we used |-D'| ^ AKi/C to get logarithmic dependence 

on e). 

Besides, we can apply Proposition E21 to D'. This defines a {8K, E2\ogi)- 
matching X (see Definition I7n|l between at most 8/*^ subwords of the bound- 
ary words of D', for some constant E2 depending on G and K but not on i 
(here again we used |-D'| ^ AKi/C to get logarithmic dependence on £). Set 
E = max{Ei,E2). 

The boundary words of D' are precisely the new relators on one side, and 
the boundary word of D on the other side. 




Each match in X is a pair of two subwords w, w' of one of the new relators 
(or of the boundary word), together with two short words u, v of length at 
most E\og£, such that w = uw'v in G. 



As there can be "invaginations" of D' into D", the lengths of w and w' 
may not be equal at all. It may even be the case that one of these two words 
is of length 0, as in the following picture. This is not overmuch disturbing 
but should be kept in mind. 



Intuitively, we can reconstruct D "at scale f if we know this matching 
X: simply take the new relators and glue them along the matches in X. 
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This leads to redefining what a decorated abstract van Kampen diagram is. 
Knowing D, the associated abstract diagram V will keep the combinatorial 
and geometric information but will forget what are the precise values of the 
new relators. Namely, given D we only keep the following information: How 
many new faces there are (that is, \D"\, which is at most K); Which new 
faces bear the same new relator or not (this can be done by attributing a 
number between 1 and \ D"\ to each face, two faces getting the same number 
if and only if they bear the same new relator) ; Which subword is matched 
to which one (that is, where the cuttings of the subwords were done and 
what the pairing is). This leads to the following (compare the definition of 
a davKd given in the introduction page [121 together with Proposition 17^ . 

Definition 39 - A decorated abstract van Kampen diagram (davKd for 
short) V is the following data: 

• An integer k (the number of faces), also denoted \V\; any number 
between 1 and k will be called a face ofV. 

• An integer \dV\ between and \V\i{l + 4A/C), called the boundary 
length ofV. 

• A set ofk integers between 1 and k (which faces bear the same relator). 

• For each face, a number between 1 and i (a starting point for the 
relator) and an orientation ±1. 

• A partition of the set {1, ...,£} x {1, ... , A;}U{1, ...,\dV\}x {k+1} into 
8k subsets (some of which may be empty) of the form {i,i + 1, . . . ,i + 
j} X {p}. The subsets {1, ...,£} x {p} will be called words in V, with 
{1, . . . , |(9P|} X {k+1} being the boundary word and the others internal 
words. The elements of the partition will be called subwords in V, and 
the length of a subword {i,i + 1, . . . ,i + j} x {p} will be j + 1. 

• A partition of the set of subwords into two parts and a bijection between 
these parts (which subword is matched to which). A pair of two bijected 
subwords will be called a match in V. 

A very important fact is the following one. 

Proposition 40 - For a fixed K, the number of different davKd's with 
at most K faces is less than some polynomial in i. 

Proof - This is at most K.Ki(\ + ^\IC).K^ .i^ .2^ .((K +V)if^ .(%Kf^ . 
□ 

We will still add some decoration below in section 16.81 This further 
decoration will again be polynomial in i. 

We just saw that to our van Kampen diagram D we can associate a 
decorated abstract van Kampen diagram P, coming from Proposition E2l 
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This we will call the davKd associated to D. This davKd sums up all quasi- 
equalities in G imposed by the diagram on the new relators. 

Definition 41 - Let V be a davKd. We say that a van Kampen diagram 
D = D'U D" ofG/{R) fulfills V ifV is the davKd associated to D and if D 
satisfies tfie assumptions of Proposition [131 and Lemma [23 tfiat is; 

• The number of faces of D" is at most K; 

• D' is minimal among van Kampen diagrams in G with the same bound- 
ary; 

• D is strongly reduced with respect to G; 

• \D'\ ^A\D"\i/G. 

A davKd V is said to be fulfillable if some van Kampen diagram fulfills 

it. 

A davKd V is said to satisfy an a-isoperimetric inequality if 

\dV\ ^ ai\V\ 

Proposition 42 - If some davKd V satisfies an a-isoperimetric inequal- 
ity, then any van Kampen diagram D = D' U D" fulfilling V satisfies the 
isoperimetric inequality 

\dD\ ^ a£\D"\/2 + Ga\D'\/8 

Proof - Indeed, since \D'\ ^ A\D"\e/G we have ai\D"\ /2 + Ga\D'\ /8 ^ 
ai\D"\. □ 

Thus, to prove PropositioninHlwe have to show that, with high probability, 
any fulfillable davKd satisfies some linear isoperimetric inequality (with some 
isoperimetric constants depending on G, the density d and the random model 
but not on K or i). 

In the matching X of D, there may be matches between subwords of 
the boundary, as in the following figure. Such parts of the diagram always 
improve isoperimetry (up to 2£'log^). So in the following we consider that 
all matches in X match either two subwords of the new relators or a subword 
of a new relator and a subword of the boundary. 




6.4 Graph associated to a decorated abstract van Kam- 
pen diagram 

As in the case of random quotients of the free group, we will construct an 
auxiliary graph T summarizing all conditions imposed by a davKd on the 
random relators of R. But instead of imposing equality between letters of 
these relators, the conditions will rather be interpreted as equality modulo 
G. 

Let now -D be a davKd. We will evaluate the probability that it is fulfil- 
lable by the relators of R. 

Each face of D bears a number between 1 and \D\. Let n be the number 
of such distinct numbers, we have n ^ \D\. Suppose for the sake of simplicity 
that these n distinct numbers are 1,2, ... ,n. 

To fulfill the diagram is to give n relators ri, . . . , r„ satisfying the condi- 
tions that if we put these relators in the corresponding faces, then by gluing 
"up to small words" the faces along the subwords desribed in the davKd, we 
get a (strongly reduced) van Kampen diagram of G/{R). 

We now construct the auxiliary graph F. 

Take ni points as vertices of F, arranged in n parts of i vertices called 
the parts of F. Interpret the k-th vertex of the i-th part as the k-th letter 
of relator rj in R. Internal subwords in D are identified with successive 
vertices of F (with a reversal if the face in D to which the subword belongs 
is negatively oriented). 

We now explain what to take as edges of F. 

Let / be a match in D. First, suppose that this is a match between two 
internal subwords in D. Say it is a match between subwords of faces of D 
bearing numbers i and i'. These two subwords in D correspond to two sets 
of successive vertices in the i-th part and the i'-th part of F. 

Add to F a special vertex w called an internal translator. Add edges 
between w and each of the vertices of the i-th part of F represented by the 
first subword of /; symmetrically, add edges between w and each of the 
vertices of the i'-th part of F belonging to the other subword of /. 

(This may result in double edges ifi = i'. We will deal with this problem 
later, but for the moment we keep the double edges.) 

Follow this process for all matches between internal subwords of D. Each 
translator so obtained is connected with two (or maybe one if i = i') parts 
of F. 

As several faces of D may bear the same number (the same relator of 
R), a vertex of F is not necessarily of multiplicity one. The multiplicity of a 
vertex of the i-th part is at most the number of times relator i appears on a 
face of D. 

For each match in D involving a boundary subword and an internal sub- 
word adjacent to, say, face i, add a special vertex b to F, called a boundary 
translator. Add edges between b and the vertices of the i-th part of F corre- 
sponding to the internal subword of the match at play. 
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At the end of the process, the number of edges in T is equal to the 
cumulated length of all internal subwords of D, which is i\D\. 

Here is an example of a simple van Kampen diagram on G/{R), its as- 
sociated davKd (represented graphically by a diagram "at scale f ), and the 
associated graph T. 




In a davKd associated to some van Kampen diagram with at most K 
faces, as we only consider, since the number of matches is at most AK, the 
number of internal and boundary translators in T is at most as well. 

Note that each translator corresponds, via Proposition 17^ to a word in 
the van Kampen diagram which is equal to e in G: translators are nothing 
else but matches of the davKd. Indeed, fulfillability of the davKd implies 
that for each (say internal) translator in F, we can find a word w which is 
equal to e in G, and such that w = W1UW2V where u and v are short (of length 
at most E\og£) and that wi and W2 are the subwords of the relators of R 
to which the translator is joined. In the case of random quotients of Fm, we 
had the relators of R directly connected to each other, imposing equality of 
the corresponding subwords; here this equality happens modulo translators 
that are equal to e in G. 

6.5 Elimination of doublets 

A doublet is a vertex of T that is joined to some translator by a double edge. 
This can occur only if in the van Kampen diagram, two nearly adjacent faces 
bear the same relator. 

Doublets are annoying since the two sides of the translator are not chosen 
independently, whereas our argument requires some degree of independence. 
We will split the corresponding translators to control the occurrences of such 
a situation. 

This section is only technical. 

Consider a translator in the van Kampen diagram bordered by two faces 
bearing the same relator r. As a first case, suppose that these two relators 
are given the same orientation. 

Let w be the translator, w writes w = u6iu'62 where u and u' are subwords 
of r, and (5i 2 are words of length at most 2Elogi. The action takes place 
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in G. As u and u' need not be geodesic, they do not necessarily have the 
same length. Let Ui be the maximum common subword of u and u' (i.e. their 
intersection as subwords of r). If ui is empty there is no doublet. 

There are two cases (up to exchanging u and u'): either u = U2UiU^ and 
u' = ui, OT u = U2U1 and u' = U1U3. 




We will only treat the first the second one is similar. 

Redefine ui, U2 and to be geodesic words equal to ui, U2 and M3 respec- 
tively. In any hyperbolic space, any point on a geodesic joining the two ends 
of a curve of length L is (1 + 51ogL)-close to that curve (cf. [BHij . p. 400). 
So the new geodesic words are (1 + 5 log £) -close to the previous words Ui, 
M2, M3. Hence, up to increasing E a little bit, we can still suppose that D is 
fulfillable such that D' is log ^-narrow, and that mi, M2, are geodesic. 

Define points A, A', B, B', C, D as in the figure. The word read while 
going from A' to B' is the same as that from D to C . 

By elementary hyperbolic geometry, and given that the two lateral sides 
are of length at most log£, any point on CD is {25 + log -close to 
some point on AA' or B'B, or 25-close to some point on A'B' . 

The idea is to run from D to C, and simultaneously from A' to B' at 
the same speed. When the two trajectories get -Elog^-close to each other, 
we cut the translator at this position, and by construction the resulting two 
parts do not contain any doublets. 

Let L = I Ml I and for ^ ? ^ L, let Ci be the point of DC at distance 
i from D. Now assign to i a number between and L as follows: Ci 
is close to some point C[ of AB, set ^p{Ci) = if C- G AA' , ip{Ci) = L if 
C[ E B'B, and ^{C,) = dist(C;, A') if Ci G A'B'. 

By elementary hyperbolic geometry (approximation of A'B' DC by a 
tree), the function ip : [0;L] — > [0]L] is decreasing up to 86 (that is, i < j 
implies ip{i) > ip{j) — S6). We have ip{0) = L and (p{L) = (up to 86). Set 
io as the smallest i such that ip{i) < i. This defines a point Cig on DC and 
a point C-|j on AB. 

There are six cases depending on whether C'^^ and Cj'^_^ belong to AA', 
A'B' or B'B. In each of these cases we can cut the diagram in at most three 
parts, in such a way that no part contains two copies of some subword of ui 
(except perhaps up to small words of length at most 86 at the extremities). 
The cuts to make are from to C'^^ and/or to C*-^-!, and are illustrated 
below in each case. 
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A translator is a vertex of T and by "cutting a translator" we mean that 
we split this vertex into two, and share the edges according to the figure. 

As our second (and more difficult) case, suppose that the translator is 
bordered by two faces of the diagram bearing the same relator r of R with 
opposite orientations. This means that the translator w is equal, in G, to 
u6iu'~^62 where u and u' are subwords of the relator r, and where 5i 2 are 
words of length at most 2E\ogi. 

As above, let Ui be the maximum common subword of u and u' (i.e. their 
intersection as subwords of r). There are two cases: u = U2U1U3 and u' = ui, 
or u = U2U1 and u' = Uiu^. 




We will only treat the first the second is similar. 

As above, redefine ui,U2 and ^3 to be geodesic. 

Define points A, A', B, B', C, D as in the figure. The word read while 
going from A' to B' is the same as that from C io D. 

By elementary hyperbolic geometry, and given that the two lateral sides 
are of length at most 2£'log£, any point on CD is {25 + 2_E' log -close to 
some point on AA' or B' B^ or 25-close to some point on A' B' . 

If any point on CD is close to a point on either AA' or BB' ^ we can 
simply eliminate the doublets by cutting the figure at the last point oi CD 
which is close to AA' . (As above, by cutting the figure we mean that we split 
the vertex of V representing the translator into three new vertices and we 
share its edges according to the figure.) In this way, we obtain a new graph 
r with the considered doublets removed. 




D 
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Otherwise, let L = and for ^ z ^ L, let Ci be the point of CD at 
distance i from C. Now assign to z a number (p{i) between and L as follows: 
Ci is close to some point C'^ of AB, set (p{Ci) = ii C'^ e AA', (p{Ci) = L ii 
CI e B'B, and ^{C,) = dist(Q', A') if ^ G A'B'. 

It follows from elementary hyperbolic geometry (approximation of the 
quadrilateral CA'B'D by a tree) that ip : [0; L] — > [0; L] is an increasing 
function up to 86 (that is, i < j implies ip{i) < ip{j) + 86). Moreover, let 
i be the smallest such that ip{i) > and j the largest such that ip{j) < L. 
Then ip is, up to 85, an isometry of to [ip{i); ip{j)] (this is clear on the 
approximation of CA'B'D by a tree). In other words: the word Ui is close 
to a copy of it with some shift (p{i) — i. 

Cut the figure in five: cut between Ci and C^', between Ci and a point of 
AA' close to it, between Cj and C'j and between Cj and a point of B'B close 
to it (such points exist by definition of i and j). 




This way, we get a figure in which only the middle part CiCjC'^C'^ of the 
figure contains two copies of a given piece of Mi. Indeed (from left to right in 
the figure) the first part contains letters to i of the lower copy of ui and no 
letter of the upper ui, the second part contains letters to ip{i) of the upper 
Ui and no letter of the lower Ui; the third part CiCjC'jC'i contains letters i 
to j of the lower Ui and letters (p{i) to ip{j) of the upper Ui; the fourth and 
fifth part each contain letters from only one copy of Ui. 

First suppose that the intersection of and [ip{i)] (p{j)] is empty, or 
that its size is smaller than ei \ ui\ (for some small ei to be fixed later on, 
depending on d and G but not on i). Then, in the new graph T defined by 
such cutting of the translator, at most ei \ ui\ of the doublets at play remain. 
Simply remove these remaining double edges from the graph T. 

In case the intersection of and [(p{i); fiJ)] is not smaller than ei \ ui\, 
let us now deal with the middle piece. 

Consider the subdiagram CiCjC'jC'i. it is bordered by two subwords u'i,u'l 
of Ui of non-empty intersection. The subword u'^ spans letters i to j of Mi, 
whereas u'l spans letters ip{i) to </?(j), with ip{j) — ip{i) = j — i up to 86. 

First suppose that the shift ip{i) —i is bigger than 62 \ ui\. Then, chop the 
figure into sections of size 62 \ ui\: 
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The word read on one side of a section is equal to the word read on the 
other side of the following section, but there are no more doublets. The 
original translator has been cut into at most l/£2 translators, the length of 
each of which is at least 62 \ ui\. 

Second (and last!), suppose that the shift (p{i) — z is smaller than £2 
This means that we have an equality Wivw2v~^ in G, where u is a subword of 
a random relator r, of length at least Si and with wi, W2 words of length 
at most £2 |wi|. 

As the diagram is strongly reduced, Wi and W2 are non-trivial in G. As 
the virtual centre of G has been supposed to be trivial, the probability of 
this situation is controlled by Axiom 4. Let this translator as is, but mark it 
(add some decoration to F) as being a commutation translator. Furthermore, 
remove from this translator all edges that are not double edges, that is, all 
edges not corresponding to letters of the v above (there are at most 2e2 \ ui\ 
of them). 

Follow this process for each translator having doublets. After this, some 
doublets have been removed, and some have been marked as being part of 
a commutation translator. Note that we suppressed some of the edges of F, 
but the proportion of suppressed edges is less than ei + 2e2 in each translator. 

6.6 Pause 

Let us sum up the work done so far. Remember the example on page l5fll 

Proposition 43 - For each strongly reduced van Kampen diagram D of 
the quotient G/{R) such that \D"\ < K and \D'\ ^ 4:\D"\i/C, we have 
constructed a graph F enjoying the following properties: 

• Vertices of F are of four types: ordinary vertices, internal translators, 
boundary translators, and commutation translators. 

• There are ni ordinary vertices of F, grouped in n so-called parts, of i 
vertices each, where n is the number of different relators of R that are 
present in D. Hence each ordinary vertex of F corresponds to some 
letter of a relator of R. 

• The edges of F are between translators and ordinary vertices. 

• The number of edges at any ordinary vertex is at most equal to the 
number of times the corresponding relator of R appears in D. 
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• For each internal translator t, the edges at t are consecutive vertices 
of one or two parts of F, representing subwords u and v of relators of 
R. And there exists a word w such that w = S1US2V and w = e in G, 
where (5i 2 have length at most 2E\ogi. 

• For each boundary translator b, the edges at b are consecutive vertices 
of one part of F, representing a subword u of some relator of R. For 
each such fe, there exists a word w such that w = 61U62V and w = e 
in G, where f is a subword of the boundary of D, and where 61^2 have 
length at most 2E\og£. 

• For each commutation translator c, the edges at c are double edges to 
successive vertices of one part of F, representing a subword u of some 
relator of R. And there exists a word w such that w = 6iu62U~^ and 
w = e in G, where 612 have length at most £2 

• There are no double edges except those at commutation translators. 

• There are at most 4:K/e2 translators. 

• The total number of edges of F is at least \D" \ i{l — ei — 262). 

The numbers K and 61,62 are arbitrary. The number E depends on G 
and K but not on £. 

Axioms 2, 3 and 4 are carefully designed to control the probability that, 
respectively, a boundary translator, internal translator, and commutation 
translator can be filled. 

Note that this graph depends only on the davKd associated to the van 
Kampen diagram (up to some dividing done for the elimination of doublets; 
say we add some decoration to the davKd indicating how this was done). 

Keep all these properties (and notations) in mind for the sequel. 

6.7 Apparent length 

The line of the main argument below is to fulfill the davKd by filling the 
translators one by one. 

As the same subword of a relator can be joined to a large number of 
different translators (if the relator appears several times in the diagram), 
during the construction, at some steps it may happen that one half of a 
given translator is filled, whereas another part is not. The solution is to 
remember in one way or another, for each half-filled translator, what is the 
probability that, given its already-filled side, the word on the other side will 
fulfill the translator. This leads to the notion of apparent length, which we 
define now. 

Say we are given an element x of the group, of norm We try to 

know if this is a subword of one of our random words under the probability 
measure and to determine the length of this subword. 
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Given Axiom 2, a good guess for the length of the subword would be 
||x|| 1^2, with the probability of a longer subword decreasing exponentially. 

Given Axiom 3, a good method would be to take another subword y of 
length \y\ Sui random under /x^, and test (taking u = v = e m Axiom 3) the 
probability that xy = 1. If x were a subword under /i^, this probability would 
be roughly (2m)~'^'^l^l+l^l'', hence an evaluation — ■|logPr(x?/ = e) — \y\ for 
the hypothetical length of the subword x. 

This leads to the notion of apparent length. 

We are to apply Axiom 3 to translators, with u and v of size 2_E'log^. For 
fixed X G G, let L ^ and denote by pL^xuyv = e) the probability that, if y 
is a subword of length L under /i£ (in the sense of Definition Uni) there exist 
words u and v of length at most 2E\ogi such that xuyv = e. 

Definition 44 (Apparent length at a test-length) The 

apparent length of x at test-length L is 



By definition, if we have a rough evaluation of pi, we get an evaluation 
of Li up to o{i) terms. 

We are to apply this definition for y a not too small subword. That is, we 
will have s^I/ki ^ \y\ ^ ki£ with ki as in Axiom 1, for some to be fixed 
soon. We will also use the evaluation from Axiom 2. 

Definition 45 (Apparent length) - The apparent length of x is 



uniformly in i' . 

As usual, in this proposition the sense of "for any subword under /x^" is 
that of Definition Uni 

Proof - This is simply a rewriting of Axioms 2 and 3, combined to the 
observation that the choice of the test-length and of the small words u and 
V (which are of length O(log^)) only introduces a polynomial factor in i. □ 

In our proof, we will also need the fact that the number of possible ap- 
parent lengths for subwords under /i£ grows subexponentially with £. So we 
need at least a rough upper bound on the apparent length. 



= — — logpL^xuyv = e) — L 




Our main tool will now be the following 
Proposition 46 - For any subword x under /i^, we have 



Pr(L(x) ^\x\-i')< (2m) 
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By definition, if x appears with probability p as a subword under /x^, 
then by symmetry y will by equal to with the same probability, and 
thus the probability that xuyv = e is at least (taking u = v = e). Thus 
L]3.](x) ^ — |logp — |a;|. Reversing the equation, this means that for any 

subword x under we have that Pr(L(x) ^ L) ^ (2m)~^(^~l^l^/^. 

In particular, taking L large enough {L ^ A^ is enough) ensures that in 
a set of {2mY^ randomly chosen elements under /i^ with d < P, subwords 
of length greater than L only occur with probability exponentially small as 
i oo. Thus, we can safely assume that all subwords of words of R have 
apparent length at most 4i. 

In the applications given in this text to plain random words or random 
geodesic words, apparent length has more properties, especially a very nice 
behavior under multiplication by a random word. In the geodesic word 
model, apparent length is simply the usual length. We do not explicitly need 
these properties, though they are present in the inspiration of our arguments, 
and thus we do not state them. 

6.8 The main argument 

Now we enter the main step of the proof. We will consider a davKd and 
evaluate the probability that it is fulfillable. We will see that either the 
davKd satisfies some isoperimetric inequality, or this probability is very small 
(exponential in £). It will then be enough to sum on all davKd's with at most 
K faces to prove Proposition [33 

In our graph F, the ordinary vertices represent letters of random relators. 
Say r has ni ordinary vertices, that is, the faces of D" bear n different 
relators of R. 

We will use the term letter to denote one of these vertices. Enumerate 
letters in the obvious way from 1 to ni, beginning with the first letter of the 
first relator. So, a letter is a number between 1 and ni indicating a position 
in some relator. Relators are random words on elements of the generating 
set S of G, so if z is a letter let fi be the corresponding element of S. 

Since the relators are chosen at random, the /j's are random variables. 

As in the case of random quotients of the free group, the idea is to con- 
struct the graph T step by step, and evaluate the probability that at each 
step, the conditions imposed by the graph are satisfied by the random set 
R of relators. We will construct the graph by groups of successive letters 
joined to the same translators, and use the notion of apparent length (see 
Definition mj to keep track of the probabilities involved at each step. 

For a letter i, write z G t if z is joined to translator t. For 1 ^ a ^ n, 
write i e a to mean that letter i belongs to the a-th part of the graph. So 
is the product of the /j's for i E a. 

Consider an internal translator t. There is a word w associated to it, 
which writes w = u6iv62 where 61^2 are short and u and v are subwords 
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of the random relators. The subwords u and v are products of letters, say 
u = fp . . . fg and V = fr . ■ ■ fs- Reserve these notations w{t), u{t), v{t), pit), 
q{t), r{t) and s{t). Give similar definitions for boundary translators and 
commutation translators. 

Call u and v the sides of translator t. The translator precisely imposes 
that there exist short words 81,82 such that u8iv82 = e in G. We will work 
on the probabilities of these events. 

Some of the translators may have very small sides; yet we are to apply 
asymptotic relations (such as the definition of cogrowth) which ask for ar- 
bitrarily long words. As there are at most 4:K/e2 translators, with at most 
two sides each, the total length of the sides which are of length less than e^i 
does not exceed e3£.8K/e2- Setting = e\/SK ensures that the total length 
of these sides is less than £2^- 

Call zero-sided translator an internal translator both sides of which have 
length less than e^L Call two-sided translator an internal translator having 
at least one side of length at least e^l and its smaller side of length at least 
£3 times the length of its bigger side. Call one-sided translators the rest of 
internal translators. 

Throw away all zero-sided translators from the graph F. This throws 
away a total length of at most £2^, and do not call sides any more the small 
sides of one-sided translators. Now we have two-sided translators, one-sided 
translators, commutation translators and boundary translators, all sides of 
which have length at least So if £ is large enough (depending on £3) we 
can apply the probability evaluations of Axioms 1-4 without trouble. 

For a letter z, say that translator t is finished at time i if z ^ s{t). Say 
that two-sided translator t is half-finished at time i if q{t) ^ i < r{t). 

Add a further decoration to F (and to the davKd): for each two-sided 
translator t, specify an integer L{t) between and M (remember we can sup- 
pose that every subword has apparent length at most M). This will represent 
the apparent length of the half-word u{t) associated to the diagram when it 
is half-finished. In the same vein, specify an integer L{h) between and M for 
each boundary translator 6, which will represent the apparent length of the 
word u{h) when h is finished. We want to show that the boundary length is 
big, so we want to show that these apparent lengths of boundary translators 
are big. What we will show is the following: if the sum of the imposed L(&)'s 
for all boundary translators h is too small, the probability that the diagram 
is fulfillable is small. 

Now say that a random set of relators ri, . . . ,r„ fulfills the conditions 
of F up to letter i if for any internal or commutation translator t which is 
finished at time i, the corresponding word w{t) is trivial in G; and if, for any 
half-finished two-sided translator t, the apparent length of the half-word u{t) 
is L{t)] and if, for each finished boundary translator 6, the apparent length 
oiuih) is Lib). 

(An apparent length is not necessarily an integer; by prescribing the ap- 
parent length of u{t), we prescribe only the integer part. As £ is big the 
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discrepancy is totally negligible and we will not even write it in what fol- 
lows.) 

Of course, fulfillability of the davKd implies fulfillability of F up to the 
last letter for some choice of ri, . . . , r„ G i? and for some choice of the L(t)'s. 
(It is not exactly equivalent as we threw away some small proportion ci of 
the edges.) 

For a given relator r, there may be some translators having a side made 
of an initial and final piece of r, so that the side straddles the first letter of 
r. As we will fill in letters one by one starting with the first ones, we should 
treat these kind of translators in a different way. The simplest way to treat 
this little problem is a further cutting of the translators that straddle the 
beginning of a word, using Proposition ESI as is best explained by a figure 
(the thick dot represents the beginning of some relator). 




Up to now there are three free variables in our argument: K, the maximal 
number of new cells in diagrams we consider; and ei and 62, which are linked 
to the way we cut translators to eliminate doublets. 

Proposition 47 - For every density d < P, for every K, there exists 
€1,62 > such that, if i is large enough, then, for any davKd V, either V 
satisfies a ^{1 — d/ (3)-isoperimetric inequality (in the sense of Definition al]) , 
where ^2 is the constant in Axiom 2, or the probability that V is fulfillable 
is less than (2m) "^(^"^^/^ 

Before proceeding to the proof of this proposition, let us show how it 
implies Proposition ESI via Proposition |321 

If we know that the number of distinct davKd's associated to a van Kam- 
pen diagram satisfying the assumptions of Proposition IHHl is polynomial in 
i, then summing the probability evaluation of Proposition HZI on all such 
davKd's we can conclude: in this case, the probability that there exists a 
davKd violating the isoperimetric inequality is exponentially small, and so 
any van Kampen diagram will satisfy an isoperimetric inequality, since any 
van Kampen diagram satisfying the assumptions of Proposition [221 and |HH1 
has an associated davKd. So we will evaluate the number of davKd's with 
at most K faces. 

But by Proposition SOI the number of possible davKd's is polynomial in 
i at fixed K. We have to beware we added some extra decoration to the 
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davKd in between: in the elimination of doublets (we made at most K/e2 
more cuttings, which can be kept track of by as many numbers between 1 and 
Ki), and when prescribing an apparent length for each internal translator 
(at most 4:K/e2 numbers between 1 and M). So the number of possibilities 
remains polynomial in i (all other things being fixed). 

This proves that the probability that there exists a davKd violating the 
isoperimetric inequality decreases exponentially with i, hence Propositionl35l 

Proof of Proposition Choose some integer K. It is time to fix 

the parameters ei, £2- Recall we set £3 = 62/ 8K. Also recall that the sides 
of translators are of length at least £3^, so that we will take i large enough 
depending on 63 (that is, depending on K and on the axioms). 

With foresight, let e = ei + 3e2 + 74£2//5 + £3/1^2 where K2, 14,13 are the 
constants appearing in the axioms. Choose ei and 62 small enough so that 
e ^ (1 — d/ j3)/A. These choices depend on d and G but not on £ neither 
on any diagram. 

Let Pj be the probability that some fixed choice of n relators ri, . . . , G 
R under our law /i£ fulfills V up to letter i. This does not take into account 
the choice of n relators among the {2mY^ relators of the presentation. The 
quantity Pj depends only on the davKd V and on the law iii of the relators. 

Let 1 ^ a ^ n (recall n is the number of parts of the graph, or the number 
of different relators of R appearing in the diagram) . Let be the number 
of times relator a appears in the diagram. Let iq be the first letter of a, and 
if the last one. 

Let P"" be the probability that there exists a choice of relators ri, . . . ,ra 
in R fulfilling the conditions of P up to letter if (the last letter of a). As 
there are by definition (2m) '^^ choices for each relator, we have 

which expresses the fact that when we have fulfilled up to part a — 1, to 
fulfill up to part a is to choose the a-th relator in R and to see if the letters 
fif^, . . . , fi^ of this relator fulfill the conditions imposed on the a-th part of 
the graph by the translators. 

Let Aa be the sum of all L(t)'s for each two-sided translator t which is 
half-finished at time if, plus the sum of all L(6)'s for each boundary translator 
b which is finished at time if. We will study Aa — Aa-i. 

Lemma 48 - For any davKd V with at most K faces, for any 1 ^ a ^ n 
we have 

Aa - Aa., > ma -e) + ^""^^ ^"'^ ) + 0(1) (.) 

where the constant implied in o{i) depends on K but not on the diagram V. 
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Before proving this lemma, let us finish the proof of Proposition 1471 
Recall we saw above that 



where the (2m) '^^ factor accounts for the choice of the relator in R. 

Set da = log2m (compare the case of random quotients of Fm). Beware 
the daS are non-positive. From {-k) we get 

Aa - ^ (^iil -e) + d_azA_L^^ + o{l) 

Compare this to the equation linking dimension and number of edges on 
page [13 (and recall that here Aa is not the number of edges but the apparent 
length, which varies the opposite way, and that we want it to be big). 

Summing the inequalities above for a from 1 to n gives 
An ^ ^{l - e)^ma-'^^ma + ^^ma{da- da-l) + o{i) 



nia+i) + o{l) 



The number of summands is n ^ A', so that the constant in o{€) is 
controlled by K again. 

At the end of the process, all translators are finished, so by definition An 
is simply the sum of the apparent lengths of all boundary translators, that 
is An = Y.,L{h). 

Now recall that (if V is ever fulfillable) a boundary translator h means the 
existence of an equality e = 5iu52V in G, with by assumption L(m) = L{h), 
the 5's of length at most 2E log and v lying on the boundary of the diagram. 
By the definition of apparent length (Definition HHl which takes Axiom 2 into 
account), we have ^ K2L(m) = i^2L{h), thus \\v\\ ^ — ||5i|| — ||52|| ^ 
K,2L{b) + o{£). As V lies on the boundary of V this implies 

\dV\^ K2An + 0{i) 

(once again we can sum the o(£)'s harmlessly since the number of translators 
is bounded by some function of K.) 

So using the lower bound for An above we get 

\dV\ ^ |P|£(l-£-d//3)K2 + ^5];4(m,-m,+i) +o{i) 

Recall we managed to choose e ^ {1 — d/ P)/4:. Also take £ large enough 
so that the o{i) term is less than £{1 — d/l3)/A (such an I depends on K). 
The inequality above rewrites 

\dV\ ^ \V\l{l-d/l3)K2/2+'^Y.'^a{ma-ma+i) 
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We are free to choose the order of the enumeration of the parts of the 
graph. In particular, we can suppose that the m^'s are non-increasing. 

As '^rua = \T>\, we have J^^ai'^'a — "^a+i) ^ \^\ inf da (recall the daS 
are non-positive). Thus 

\dV\ ^ !^\V\i{f]-d + 2mida/e) 

By definition, the probability that the davKd is fulfillable is less than 
{2mY'' for all a. This probability is then less than (2m)™^''". 

First suppose that inf da ^ — — (i)/4. Then we have the isoperimetric 
inequality 

\dV\ ^'ji\V\ {l-d/p) 

as needed. 

Or, second, suppose inf da < —l{l3—d)/A. This means that the probability 
that the davKd is fulfillable is less than (2m) "^('^-'^^/^ 
This proves Proposition ITTI assuming Lemma HHl □ 

Proof of Lemma 1481 The principle of the argument is to look at 
the evolution of the apparent length of the translators, where the apparent 
length of a translator at some step is the apparent length of the part of 
this translator which is filled in at that step. We will show that our axioms 
imply that when we add a subword of some length, the probability that the 
increase in apparent length is less than the length of the subword added is 
exponentially small, such that a simple equation is satisfied: 

, , AlogP 

(where A denotes the difference between before and after filling the subword). 
This will be the motto of our forthcoming arguments. 

But at the end of the process, the word read on any internal translator 
is e, which is of apparent length 0, so that the only contribution to the total 
apparent length is that of the boundary translators, which we therefore get 
an evaluation of. 

Now for a rigorous exposition. The difference between Aa and Aa-i is 
due to internal translators which are half-finished at time iq and are finished 
at time if, to internal translators which are not begun at time zq and are 
half-finished at time if, and to boundary translators not begun at time io 
but finished at time if-, that is, all internal or boundary translators joined to 
a letter between io and if. 

First, consider a two-sided translator t which is not begun at time io and 
half-finished at time if. Let AfAa be the contribution of this translator to 
Aa — Aa-i, we have AtAa = L{t) by definition. Taking notations as above, 
we have an equality e = u5iv52 in G. By assumption, to fulfill the conditions 
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imposed by T we must have L(n) = L(t). The word u is a subword of the 
part a of r at play. But Proposition!^ fthat is, Axioms 2 and 3) tells us that, 
conditionally to whatever happened up to the choice of u, the probability that 
L(u) = L{t) is roughly less than (2m) "^('"'"-^^^^^ Thus, taking notations as 
above, with p the first letter of u and q the last one, we have 

P,/P,_i < (2m)-'3(l^l-^W) 

or, taking the log and decomposing u into letters: 

A,A^> 1 + ^"^^"^^"/^^"^^"^ +o{i) 

where 1 stands for the length of one letter (!). Note that a rough evaluation 
of the probabilities gives an evaluation up to o{£) of the apparent lengths. 
This is the rigorous form of our motto above. 

Second, consider an internal translator t which is half-finished at time io 
and finished at time if. Let AtAa be the contribution of this translator to 
Aa — Aa-i, we have AtA = —L{t). Taking notations as above, we have an 
equality e = u6iv62 in G. By assumption, we have L(u) = L(t). But the very 
definition of apparent length (Definition Hi]) tells us that given u, whatever 
happened before the choice of v, the probability that there exist such 5i 2 
such that e = uSivS2 is at most (2m)-'^('^(")+l"l). Thus 

P./P._i < (2m)-'^(^W+l''l) 

where r and s are the first and last letter making up v. Or, taking the log 
and decomposing v into letters: 

A,A^^ 1 + ^"^^-^--^"^^-^-^ +oi£) 

i&,i£a 

which is exactly the same as above. 

Third, consider an internal translator t which is not begun at time tQ and 
finished at time if, that is, t is joined to two subwords of the part a of the 
graph at play. As we removed doublets, the subwords u and v are disjoint, 
and thus we can work in two times and apply the two cases above, with first 
t going from not begun state to half-finished state, then to finished state. 
The contribution of t to Aa — Aa-i is 0, and summing the two cases above 
we get 

which is exactly the same as above. 

Fourth, consider a commutation translator t which is not begun at time 
and is finished at time if. Write as above that e = 5iu52U~^ in G, with 
5i and 82 of length at most £2 |^|- By Axiom 4, whatever happened before 
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the choice of u, this event has probabihty roughly less than (2m)''''"^2l"'~^l"' 
where 74 is some constant. Take £4 = '^4,62/^, and as usual denote by p and 
q the first and last letters making up u. We have shown that 

Take the log, multiply everything by two (since each letter joined to the 
commutation diagram t is joined to it by a double edge), so that 

A,Aa = ^ 5^ 2(1 - 54) + 2i^S2n^Pi^^g2rJVl ^ ^^^^ 

iet ^ 

Fifth, consider a one-sided translator not begun at time iq and finished 
at time if. We have an equality e = u6iv52 in G, where (5i2 have length 
O(log^) and \v\ ^ s^\u\ (by definition of a one-sided translator), so that 
^ £3 |m| + 0(log£). But by Axiom 2, this has probability roughly less 
than (2m)~'^''"'^^~^3/''2\ so once again, setting £5 = s^/k^. 

AtAa = ^ 2^ (1 - £5) H + o{i) 

Sixth (and last!), consider a boundary commutator t that is not begun 
at time zq and is finished at time if. Its situation is identical to that of an 
internal translator half-finished at time if (first case above), and we get 



We are now ready to conclude. Sum all the above inequalities for all 
translators joined to part a: 



t translator joined to a 



E (1-^5) + 



t non-commutation translator 



log2m Pj - lQg2m ^i-l 
log2m Pi - log2m P^-l 



+ ^2(l-£4) + 2 

t commutation translator 
+0(£) 

Recall nia is the number of times the a-th relator appears in the van Kam- 
pen diagram. The way we constructed the graph, any vertex representing a 
letter of the a-th relator is joined to translators (except for a proportion 
at most £1 + 352 that was removed), counting commutation translators twice. 
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Thus, in the sum above, each of the ^ letters of a appears exactly times, 
and so 

K - Aa-i ^ rua \ l{l-ei-e^-ei-?>e2) + 1 + o(£) 

(Because of the removal of a proportion at most Si + ?>e2 of the letters, 
some terms \og2m^if ~ loS2mPjo-i ^'^^ missing in the sum; but as for any i, 
we have ^ P^-i, the difference of log-probabilities log2m P? ~ log2m Pj-i is 
non-positive, and the inequality is true a fortiori when we add these missing 
terms.) 

Note that there is nothing bad hidden in the summation of the o{t) terms, 
since the number of terms in the sum is controlled by the combinatorics of 
the diagram (i.e. by K)^ and depends in no way on ^. 

Recall we set e = ei + ?>e2 + 74£^2//5 + s-i/ 1^2 = ^1 + 3^2 + £4 + £5, which is 
exactly what we get here. So Lemma SHI is proven. □ 

All pending proofs are finished; hence hyperbolicity of the random quo- 
tient when d < p. 

6.9 Non-elementarity of the quotient 

We now prove that if d < P, the quotient is infinite and not quasi-isometric 
to Z. 

6.9.1 Infiniteness 

Let d < p. We will show that the probability that the random quotient is 
finite decreases exponentially as £ — > 00. 

We know from hyperbolicity of the quotient (Proposition l36|l that the 
probability that there exists a van Kampen diagram of the quotient whose 
part made of old relators is reduced and which is strongly reduced with 
respect to G, violating some isoperimetric inequality, is exponentially close 
to 0. 

Imagine that G/ {R) is finite. Then any element of the quotient is a 
torsion element. Let x be an element of the quotient, this means that there 
exists a van Kampen diagram D bordered by for some n. 

Now take for x a random word picked under fi^. We will show that such a 
random word is very probably not a torsion element in the quotient. Instead 
of applying the previous section's results to the random quotient of G by R, 
consider the random quotient of G by i? U {x}. Since x is taken at random, 
RU{x} is a random set of words, whose density is only slightly bigger than d; 
this density is d' = j log2m ((2m)°'^ + l) which, if £ is large enough, is smaller 
than pad is. 

Now, if G/{R) is finite then x is of torsion. Set = = {2mY^ . Con- 
sider the following family of diagrams. Let D be any abstract van Kampen 
diagram of G/{R) of boundary length ni for some n. Define the spherical 
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diagram E by gluing n faces of boundary size i on the boundary of D along 
their border, and associate to each of the new faces the relator number + 1, 
so that D is an abstract van Kampen diagram with respect to i? U {x}. If 
G/ {R) is finite then x is of torsion, thus at least one of the diagrams E in 
this family is fulfillable with respect to i? U {x}. 




By Proposition ISH we can take the strong reduction of this diagram. It 
is non-empty as the faces bearing x cannot be cancelled (they all have the 
same orientation). 

So there exists a strongly reduced van Kampen diagram of G/ {R U {x}) 
with boundary length 0. 

But we know by what we already proved (Propositions IHKl and that, 
in the random quotient G/ {R U {x}) at density c/', the existence of such a 
diagram is of probability exponentially close to as £ tends to infinity. This 
ends the proof. 

6.9.2 Non-quasiZness 

We show here that the random quotients for d < /5 are not quasi-isometric 
to Z. Of course, we suppose /5 > 0, which amounts, in the case we consider 
(plain, or reduced, or geodesic words), to G itself not being quasi-isometric 
to Z. 

We will reason in a similar manner as above to prove infiniteness. We 
will consider a random quotient by a set R of words at density d, and we will 
add to R two random words x and y picked under thus obtaining a new 
random set of words at a density d' > d. As i is big, d' is only slightly above 
d, and if £ is big enough we still have d' < 13. 

Say (from Proposition IHEI) that any strongly reduced diagram D of the 
group G/{R') satisfies an isoperimetric inequality \dD\ ^ a£\D"\ for some 
positive a, notations as above. 

Suppose that G/ {R) is quasi-isometric to Z. 

The two random elements x and y are either torsion elements or each of 
them generates a subgroup of finite index. The case of torsion is handled 
exactly as above in the proof of infiniteness. 

Thus, suppose x is not a torsion element. Let h be the index of the 
subgroup it generates. Of course h depends on x. 

For any n G Z, we can find a p such that y"' = x'^u in G/{R), where u is 
of length at most h. This equality defines a van Kampen diagram of G/ (R). 

Now glue n faces containing y and p faces containing x to the boundary 
of this diagram. This defines a van Kampen diagram of G/{R'), which we 
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can take the strong reduction of. This reduction is non-empty since faces 
bearing x and y cannot be cancelled (so in particular \D"\ ^ n+p). The 
boundary of this diagram is u. 

But n can be taken arbitrarily large, so we can take n > \u\ /a. Then the 
diagram has at least n faces and boundary length \u\, which contradicts our 
isoperimetric inequality \dD\ ^ ai\D"\. 

Of course, m, n and p depend on the random words x and y. But consider 
the following family of diagrams: for each /i G N, each p G N and each n G N 
such that n > h/a, consider all abstract van Kampen diagrams of length 
h + ni + pi, with the numbers on the faces between 1 and N = \R\. Consider 
the diagrams obtained from these by the following process: glue p faces of 
size i bearing number + 1 on the boundary, and n faces of size £ bearing 
number N + 2. 

The reasoning above shows that iiG/{R) is quasi-isometric to Z, then at 
least one of these abstract van Kampen diagrams is fulfillable by a strongly 
reduced van Kampen diagram on the relators of R'. But all these diagrams 
violate the isoperimetric inequality, hence the conclusion. 

Alternate proof. We give an alternate proof as it uses an interesting prop- 
erty of the quotients. It works only in the case of a random quotient by 
uniformly chosen plain words. 

Proposition 49 - If d > 0, then the abelianized of a random quotient 
of any group by uniformly chosen plain random words is (with probability 
arbitrarily close to 1 as i ^ oo) either {e} or Z/2Z. 

(As usual, we find Z/2Z when i is even and there are no relations of odd 
length in the presentation of G.) 

Of course this is not necessarily true if d = 0, since in this case the number 
of relations added does not tend to infinity. 
Proof 

We want to show that a random quotient in density d > of the free 
abelian group Z"^ is trivial or Z/2Z. 

Take a random word of length i on af^,...a^. By the central limit 
theorem (or by an explicit computation on the multinomial distribution), 
the number of times generator appears is roughly i/2m up to 

For the sake of simplicity, say that £ is a multiple of 2m. The probability 
that a random word w is such that all relators Oj and aj^ appear exactly 
£/2m times in w is equivalent to 



(7r£/m)(2™-i)/2 

by the central limit theorem with 2m — 1 degrees of freedom or by a direct 
computation using Stirling's formula. 

This is equivalent as well to the probability that all and aj^ appear 
exactly £/2m times, except for some appearing 1 +i/2m times and some 
ttjg appearing i/2m — 1 times, this deviation being negligible. 
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This probability decreases polynomially in ^. But we choose an exponen- 
tial number of random words, namely {2mY^. So if d > 0, with very high 
probability we will choose a word w in which all Oj and appear exactly 
(./2m times, except for some appearing 1 + £/2m times and some a^g 
appearing (/2m — 1 times. 

But w = e'rn. the quotient, and w = e in an abelian group is equivalent to 
CLioCiJ^ = e since all other relators appear exactly the same number of times 
with exponent 1 or —1 and thus vanish. 

As this occurs arbitrarily many times, this will happen for all couples of 
So these relators satisfy Oj = af^ in the quotient for all In particular, 
all relators are equal and moreover we have = a~^. 

Thus the abelianized is either {e} or Z/2Z. □ 

Corollary 50 - A random quotient of a hyperbolic group by plain ran- 
dom words for d < [3 is not quasi-isometric to Z. 

Proof - First take d > 0. It is well-known (cf. |SW| . Theorem 5.12, p. 178) 
that a group which is quasi-isometric to Z has either Z or the infinite diedral 
group Doo as a quotient. 

If Z is a quotient of the group, then its abelianized is at least Z, which 
contradicts the previous proposition. If D^o is a quotient, note that the 
abelianized of D^o is D2 = Z/2Z x Z/2Z, which is still incompatible with the 
previous proposition. So we are done if d > 0. 

Now if c? = 0, note that a random quotient with d > is a quotient of a 
random quotient with d = (isolate the first relators) . If the random group 
at d = were quasi-isometric to Z, then all of its quotients would be either 
finite or quasi-isometric to Z, which is not the case. (Note that here we use 
hyperbolicity of G to be allowed to apply our main theorem, implying that 
random quotients are non-trivial for some d > 0. It may be that random 
quotients at = of some groups are quasi-isometric to Z.) □ 

This ends the proof of Theorem El 
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A Appendix: The local-global principle, or Cartan- 
Hadamard-Gromov theorem 



The Cartan-Hadamard-Gromov theorem allows to go from a local isoperi- 
metric inequality (concerning small figures in a given space) to isoperimetry 
at large scale. It lies at the heart of our argument: to ensure hyperbolicity of 
a group, it is enough to check the isoperimetric inequality for a finite number 
of diagrams. This finite number depends, of course, of the quality of the 
isoperimetric inequality we get on these small diagrams. In particular, there 
is an algorithm to detect hyperbolicity of a given group. We will use the form 
given by Papasoglu (see |Pap| ), who has written a completely combinatorial 
proof. See also the presentation by Bowditch in |Bowj . 
Let us state the form of the theorem we will use. 

Let X be a simplicial complex of dimension 2 (all faces are triangles). A 
circle drawn m X is a sequence of consecutive edges such that the endpoint 
of the last edge is the starting point of the first one. A disk drawn m X is a 
simplicial map from a triangulated disk to X. 

The area of a disk drawn in X is its number of triangles. The length 
Ltj. of a circle drawn in X is its number of edges. (Both with multiplicity.) 
This is, X is considered being made of equilateral triangles of side 1 and area 
1. 

The area of a drawn circle will be the smallest area of a drawn disk with 
this circle as boundary, or oo if no such disk exists. The length of a drawn 
disk will be the length of its boundary. 

Theorem 51 (P. Papasoglu, cf. (Pap), after M. Gromov) 

Let X be a simplicial complex of dimension 2, simply connected. Suppose 
that for some integer K > 0, any circle S drawn in X whose area lies between 
and 2A0K^ satisfies 

Then any circle S drawn in X with A{S) ^ satisfies 

LUS) ^ MS)/K 



This theorem is a particular case of a more general theorem stated by 
Gromov in jGrolj . section 6.8.F, for a length space. Think of a manifold. At 
very small scales, every curve in it satisfies the same quadratic isoperimetric 
inequality as in the Euclidean space, with constant in. The theorem means 
that if, at a slightly larger scale, the constant in this quadratic isoperimetric 
inequality becomes better (2-10^ instead of 47r), then isoperimetry propagates 
to large scales, and at these large scales the isoperimetric inequality even be- 
comes linear. This is analogous to the fact that a control on the curvature 
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of a manifold (which is a local invariant) allows to deduce global hyperbol- 
icity properties. This was termed by Gromov hyperbolic Cartan-Hadamard 
theorem or local-global principle for hyperbolic spaces. 

The proof of Papasoglu is based on considering the smallest diagram 
violating the inequality to prove, and, by some surgery involving only cutting 
it in various ways, to exhibit a smaller diagram violating the assumptions. 
As this process only requires to consider subdiagrams of a given diagram, he 
proves a somewhat stronger theorem. 

Theorem 52 (P. Papasoglu, cf. [Pap| . after M. Gromov) 

Let X be a simplicial complex of dimension 2, simply connected. Let P be a 
property of disks in X such that any subdisk of a disk having P also has P. 

Suppose that for some integer K > 0, any disk D drawn in X having P, 
whose area lies between and 2407^^ satisfies 

Then any disk D drawn in X, having P, with A{D) ^ K^, satisfies 

Ltr{D) ^ A,,{D)IK 

In the previous version, property P was "having the minimal area for a 
given boundary", hence the change from circles to disks. 

We need to extend these theorems to complexes in which not all the faces 
are triangular. 

Let X be a complex of dimension 2. Let / be a face of X. 

The combinatorial length L^ of / is defined as the number of edges of its 
boundary. The combinatorial area Ac of / is defined as Lc{fY . 

Let D be a disk drawn in X. The combinatorial length Lc of D is the 
length of its boundary. The combinatorial area A^ of D is the sum of the 
combinatorial areas of its faces. 

Proposition 53 - Let X be a complex of dimension 2, simply connected. 
Suppose that a face of X has at most i edges. Let P be a property of disks 
in X such that any subdisk of a disk having P also has P. 

Suppose that for some integer K ^ 10^°£, any disk D drawn in X having 
P, whose area lies between K'^/A and ASOK'^ satisfies 

Lc{DY^ 2-10'' Ac{D) 

Then any disk D drawn in X, having P, with A{D) ^ K^, satisfies 

LciD) ^ AciD)/10^K 

Proof of the proposition - Of course, we will show this proposition 
by triangulating X and applying Papasoglu's theorem. 
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The naive triangulation (cut a n-gon into n — 2 triangles) does not work 
since all triangles do not have the same size. 

Triangulate all faces of X in the following way: consider a face of X with 
n sides as a regular n-gon of perimeter n in the Euclidean plane. Consider a 
triangulation of the plane by equilateral triangles of side 1. (The polygon is 
drawn here with large n, so that it looks like a circle.) 



This is not exactly a triangulation, but with a little work near the bound- 
ary, we can ensure that the polygon is triangulated in such a way that all 
triangles have sides between, say, 1/10 and 10 and area between 1/10 and 10, 
so that the distortion between the triangle metric and the Euclidian metric 
is a factor at most 10. Note that the number of triangles lies between n^/100 
and lOOn^, as the (Euclidian) area of our n-gon is roughly /A-k. 

Let Y be the simplicial complex resulting from X by this triangulation. 

Let Ltr and Atr be the length and area in Y assigning length 1 to each 
edge and area 1 to each triangle. Let Lc and be the length and area in X 
defined above; in Y they can be used for disks coming from X. 

Let L and A be the Euclidean length and area in F, that is, each face of 
X with n edges is a regular n-gon, and the triangles are given their length 
and area coming from the triangulation above in the Euclidean plane. 

The discrepancy between Ltr, L and Lc, and between Ar, A and Ac, is 
at most a factor 100. 

We proceed as follows: We will show that a disk in Y with property 
P, whose area Atr{B) lies between K"^ /2 and 240K'^, satisfies Ltr(-B)^ ^ 
2 ■ lO^Ar(-B)- Then, by the above theorem, any disk B of area Ar(-B) ^ 
will satisfy Lt,{B) ^ Atr{B)/K, thus Lc(fi) ^ AciB)/10^K and we will be 
done. 

Let i? be a disk in Y with property P, whose area Ar(-B) lies between 
K'^/2 and 2A0K^. We want to show that it satisfies KiBf ^ 2 ■ lOMtr(P). 

There are two kinds of disks drawn in Y: those who come from a disk 
drawn in X, and those which there exists faces of X that are only partially 
contained in. 

For the first kind we are done: by assumption, we have Lc(P)^ ^ 2 ■ 
lOiMc(P), which implies Ltr(P)^ ^ 2 ■ lOMtr(P). 

So we want to reduce the problem to this kind of disks. 
We will need the following isoperimetric lemmas: 

Lemma 54 - Let C be a regular closed curve in a Euclidean disk D. Sup- 
pose that C encloses a surface of area at most half the area of D. Then the 
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length of the intersection of C with the boundary of D is at most 32 times 
the length of the intersection of C with the interior of D. 

(One would expect an optimal constant 7r/2 with optimal C enclosing a 
half disk.) 

This lemma is shown in |Gro3] . 6.23. The next lemma is a formal conse- 
quence thereof. 

Lemma 55 - Let C be a regular closed curve in a Euclidean disk D. Sup- 
pose that C encloses a surface of area at least half the area of D. Then the 
length of the intersection of C with the interior of D is at least 1/32 times 
the length ofdD\C. 

The next lemma is a consequence of the first one and of the usual isoperi- 
metric inequality in the Euclidean plane. 

Lemma 56 - Let C be a regular closed curve in a Euclidean disk D. Sup- 
pose that C encloses a surface of area at most half the area of D. Then the 
square of the length of the intersection of C and the interior of D is at least 
1/100 times the area enclosed by C. 

Now back to our disk BiiiY. 

Let be a face of X such that B intersects D. 

Suppose that dB fl -D is connected (that is, B intersects D only once; 
otherwise, make the following construction for each of the connected compo- 
nents). Compare the Euclidean area of B n D with that of D. If it is more 
than one half, enlarge B such that it includes all of D. 

Follow this process for each face D oi X partially intersecting B. 

Let B' be the disk in Y obtained after this process. By construction, we 
have A{B) < A{B') ^ 2A{B). By Lemma we have L{B') ^ 32L(5). 

Now, for each face D of X intersecting B' , either D (Z B' or the area of 
D n B' is at most one half the area of D. 

As a first case, suppose that the cumulated area of all such D which are 
included in B' is at least one half of the area of B'. Define B" by amputing B' 
from all faces D of X which are not totally included in B'. By assumption, 
we have A{B') ^ A{B") ^ A{B')/2. And it follows from Lemma El that 
L{B") ^ 32L{B'). 

By construction, the disk B" is now a disk made of whole faces of X. 
As A{B)/2 ^ A{B") ^ 2A{B), we have K^A ^ A{B") ^ 480^^. We can 
thus apply the isoperimetric assumption: L{B"Y ^ 2 ■ 1Q^^A{B"). Since 
L{B") ^ 322L(5) and A{B) ^ 2A{B"), we get that L{Bf ^ 2 ■ W^A{B), 
hence L^,{B) ^ 2 ■ 10Mtr(5). 

As a second case, imagine that the cumulated area of all such D which 
are wholly included in B' is less than half the area of B' . Let Di be the faces 
of X intersecting B' but not wholly contained in B' . Let = A{Di fl B'). 
We have ^ A{B')/2 ^ K'^/A. 

Let rrti = L[dB' Ci Di). By Lemma EEl we have ^ Oj/lOO. 
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Since any face of X has at most I edges, we have Ac{Di fl B') ^ i"^, so 
for any i, Oj ^ 100£^. Group the indices i in packs / so that for each /, we 
have 100^2 ^ Eie/ «i ^ 200£2. There are at least K^/800f packs /. Let 
Mi = ^i^irrii. 

We have 



and 

and as there are at least K^/SOOi"^ packs 

L(5')' ^ i^VlO^f' ^ A{B')Kyi0^f 
as ^ 480/^2. Now as L{B') ^ 32L{B) and A{B') ^ A{B) we have 

or 

and we are done as ^ 10^°£^. 

This ends the proof of the proposition. □ 
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B Appendix: Conjugacy and isoperimetry in 
hyperbolic groups 



We prove here some of the statements needed in the text about conjugacy 
of words and narrowness of diagrams in hyperbolic groups. For general ref- 
erences on hyperbolic groups and spaces we refer to |BHj . |CDP| or |GHj . 

Throughout this appendix, G will denote a hyperbolic discrete group 
generated by a finite symmetric set S*, defined by a finite set of relations R 
(every discrete hyperbolic group is finitely presented, cf. (S)). Let 5 be a 
hyperbolicity constant w.r.t. S. 

A word will be a word made of letters in S. The length of a word w will 
be its number of letters (regardless of whether it is equal to a shorter word 
in the group), denoted by \w\. 

Equality of words will always be with respect to the group G. 

Let C be an isoperimetric constant for G, i.e. a positive number such 
that any simply connected minimal van Kampen diagram D on G satisfies 
\dD\ ^ C\D\. See section [T] for definitions and references about diagrams 
and isoperimetry. 

Let us also suppose that the relations in the presentation R of G have 
length at most A. 

B.l Conjugate words in G 

The goal of this section is to show that if a word x is known to be a conjugate 
in G of a short word y, then some cyclic permutation of x is conjugate to y 
by a short word. If x = uyu"^, we will say that x is conjugate to y by u, or 
that u conjugates x and y, or that m is a conjugating word. We recall the 

Definition - A word w is said to be cyclically geodesic if it and all of its 
cyclic permutations label geodesic words in G. 

The following is well-known (cf. |BH| . p. 452, where the authors use "fully 
reduced" for "cyclically geodesic"). 

Proposition 57 - Let u, v be cyclically geodesic words representing con- 
jugate elements of G. Then 

• either \u\ ^85+1 and \v\ ^86 + 1 

• or else there exist cyclic permutations u' and v' of u and v which are 
conjugate by a word of length at most 25 + 1. 

This immediately extends to: 

Proposition 58 - Let u, v be cyclically geodesic words representing con- 
jugate elements of G. Then 

• either \u\ ^85 + 1 and \v\ ^85 + 1 
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or else there exist a cyclic permutation v' of v which is conjugate to u 
by a word of length at most 46 + 1. 



Proof - Write u = uu' and v = v v such that the cyclic conjugates 
u"u' and v"v' are conjugate by a word 6i of length at most 25 + 1 as in 
Proposition inn Construct the quadrilateral u"u'6iv'~^v"~^6^^ . As u and v 
are cyclically geodesic, the sides u"u' and v"v' are geodesic, and in this S- 
hyperbolic quadrilateral any point on one side is 2(5-close to some other side. 
In particular, any point on the side u"u' is {26 + |(5i|)-close to the side v"v' . 



5i 



V 

v' 



u" 


A ^ 


h-- 


u" 


1 









A' 
B' 



Let A be the endpoint of u" . The point A is (25 + |5i|)-close to some 
point B on v"v' . Let §2 be a path connecting A to B. The point B divides 
v"v' into two words v'" and v"" , and we have u = u'u" = d2v""v"'62^ which 
ends the proof of the proposition. □ 

We will need the following 

Proposition 59 - Let w be a geodesic word. There exists a cyclically 
geodesic word z which is conjugate to w by a word of length at most {\w\ — 
\z\){S + 1/2) + 4:6. 

Proof - Set wq = w and construct a sequence Wn of geodesic words by 
induction. If Wn is cyclically geodesic, stop. If not, then write Wn = w'^^w" 
such that w"w'^ is not geodesic. Then set Wn+i to a geodesic word equal 
to w'^w'^. As length decreases at least by 1 at each step, the process stops 
after a finite number n of steps and Wn is cyclically geodesic. Note that 
n ^ \ w\ — \ Wn\- 

In the Cayley graph of the group, define Wi to be the quasi-geodesic 
{w'qw'i . . . w'i^iWi) kez with as above: 




Consider any of the geodesic triangles made by Wi, w'l_^, ^i-i- As these 
are (5-hyperbolic, this means that any point of Wi is (5-close to the line 
Thus, any point of Wn is n(5-close to Wq. 

Consider the two endpoints of a copy of Wn lying on Wn- These two points 
are n(5-close to Wq^ and since the whole picture is invariant by translation. 
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this means that we can find a word u of length at most n5 such that u con- 
jugates Wn to some cyclic conjugate w"w' of w. Now construct the hexagon 
w"w'uw~^u~^ . 



A 







w 




c 





u 



v' B 



Let A be the endpoint of w" . By elementary 5- hyperbolic geometry (ap- 
proximation by a tripod of the triangle consisting of A and the endpoints of 
u), the distance of A to the side v is at most + + 2 |m| — |m„|)/2 + 45. 
Let -B be a point on side m„ realizing this minimal distance. Let Wn = v'v" 
such that the endpoint of v' is B. Let c be the word defined by AB. Then 
we have w'w" = cv"v'c~^, so w is conjugate to a cyclic conjugate of Wn by c. 
Taking z = v"v' ends the proof of the proposition. □ 

Now, in the spirit of Proposition EH let Cc = max^. ^ min{|M| ,x = uyu~^} 
where the range of the maximum is the set of all couples of conjugate words 
of length at most 85 + 1. As this set is finite we have Cc < oo. Let C'^ = 
Cc + 4:6"^ + 125 + 2. 

Proposition 60 - Let x be a geodesic word and y a conjugate of x of 
minimal length. Then some cyclic conjugates of x and y are conjugate by a 
word of length at most C'^. 

Proof - Let m be a conjugating word of minimal length: x = uyu~^. This 
defines a van Kampen diagram ABCD whose sides are labeled by n, y, 
and x~^ in this order. 

As X, y and u are geodesic words (by minimality assumption), the 1- 
skeleton of this diagram embeds in the Cayley graph of the group, and we 
get a hyperbolic quadrilateral ABCD in which every point on any side is 
25-close to a point on another side. 

As a first case, suppose that every point on the side AB is 25-close to 
either AD or BC . 




B 



Let A' be the first point on AB which is 2(5-close to BC . Considering the 
point just before A\ we know that A' is {25 + l)-close to AD. 
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Then we can write x = x'x", u = u'u" and y = y'y" such that there exist 
words 6i and 62 of length at most 25+1 such that u' = x'6i and u" = S2y'~^ . 
Then, we have x"x' = x'~^xx' = 6iu'~^uyu~^u'6i~^ = Siu"yu"~^6^^ = 
^i^2y"y'^2^5i^ , and the cyclic conjugate x"x' of x is conjugate to y"y' by 
a word of length at most 45 + 2. 

By symmetry the same tricks work if DC is close to DA or to CB. 

Second, if this is not the case, let and Dn be the points on AB and 
DC at distance n away from A and -D, respectively. Let n be smallest such 
that either A^ or D^ is not 25-close to AD nor to BC . By symmetry, let us 
suppose it is An rather than Dn. Let be a geodesic word joining An to Dn- 




Let u' be the prefix of u joining A to An. By definition of n the point 
An is 25 + 1-close to AD. We have u' = x'6i where x' is a prefix of x, and 
\6i\ ^ 25+1. Thus x"x' is conjugate to w by a word of length at most 25 + 1. 

Now let us work in AnBCDn. By definition of An, we know there exists 
a point A' on CD„ such that AnA' ^ 25. Now we have AnDn ^ 2S + A'Dn = 
25 + DnC - A'C = 26 + AnB - A'C ^ 45 + A'B - A'C ^ 45 + BC. Thus 
\w\ ^ + \ y\. 

By our minimality assumption, y is cyclically geodesic. If w is cyclically 
geodesic as well, then we conclude by Proposition EHl If not, use Proposi- 
tion EHl to find a cyclically geodesic word z which is conjugate to by a word 
of length at most {\w\ — \z\)[5 + 1/2) + 45. By our minimality assumption 
on y, we have that \z\ ^ hence lu^l — 1^1 ^ — |?/| ^ 45. Now z and y 
are both cyclically geodesic and we conclude by Proposition EHl D 

Corollary 61 - Let x be any word and y be a conjugate of x of minimal 
length. Then some cyclic conjugates of x and y are conjugate by a word of 
length at most 51og2 |x| + C'^+ 1. 

Proof - This is because in a hyperbolic space, a geodesic joining the ends 
of any curve of length £ stays at distance at most 1+5 log2 i from this curve 
(cf. |BH| . p. 400). Take a geodesic word x' equal to x and apply the above 
proposition; then any cyclic permutation of x' will be conjugate to a cyclic 
permutation of a; by a word of length at most 1 + 51og2 |x|. □ 
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B.2 Cyclic subgroups 

We will also need the following. 

Proposition 62 - There exists a constant R such that, for all hyperbolic 
u E G, the Hausdorff distance between the set {u"')nez and any geodesic with 
the same limit points is at most \\u\\ + R. 

Proof 

Lemma 63 - The Hausdorff distance between {u^)n&z SLnd any geodesic 
with the same limit points is finite. 

Proof of the lemma - From |GH| (p. 150) we know that k {u^)kez is 
a quasi-geodesic. From |GHj (p. 101) we thus know that this quasi-geodesic 
lies at finite Hausdorff distance from some geodesic. From |GH] (p. 119) we 
know that any two geodesies with the same limit points lie at finite Hausdorff 
distance. □ 

Now for the proposition. First, suppose that u is cyclically geodesic. Let 
p be a geodesic path joining e to u. Let A be the union of the paths 
n G Z. Since u is cyclically geodesic, A is a (1,0, ||M||)-local quasi-geodesic 
(notation as in |GH| ). Thus, there exist constants R and L depending only 
on G such that, if ^ L, then A lies at Hausdorff distance at most R 
of some geodesic A' equivalent to it (see |GHj . p. 101), hence at Hausdorff 
distance 165 + i? of any other equivalent geodesic ( |GH| . p. 119). As there 
are only a finite number of u's such that ||n|| < L, and as for each of them 
the lemma states that A lies at finite Hausdorff distance from any equivalent 
geodesic, we are done when u is cyclically geodesic. 

If u is not cyclically geodesic, apply Proposition [001 to get a cyclically 
geodesic word v such that v = xu"u'x~^ with u = u'u" and |x| ^ G'^. Apply 
the above to {v^)k&'- this set stays at distance at most R of some geodesic 

A. Translate by u'x~^. The set {u'x~^v'')k£Z stays at distance at most R of 
the geodesic u'x~^A. But since = u'x~^v^xu'~^ , the Hausdorff distance 
between the sets {u^)k& and {u'x~^v^)k(^i is at most ^ + ||n|| 
and we are done. □ 

Since the stabilizer of any point of the boundary is either finite or has Z as 
a finite index subgroup (cf |GH| . p. 154), we get as an immediate by-product 
of the lemma 

Corollary 64 - Let A be a geodesic in G, with limit points a and b. 
There exists a constant R{A) such that for any x in the stabilizer of a and 
b, the distance from x to A is at most R{A). 

B. 3 One- hole diagrams 

We now turn to the study of isoperimetry of van Kampen diagrams with 
exactly one hole. Recall that conjugacy of two words u and v is equivalent 
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to the existence of a one-hole van Kampen diagram bordered by u and v. 

Proposition 65 - There exists a constant C > such that for any two 
conjugate words u and v, there exists a one-hole diagram D bordered hy u 
and V, such that C \D\ ^ \u\ + \v\. 

Proof - Let us first suppose that u and v are geodesic words. Let w be 
the shortest common conjugate of m and v. By Proposition [001 u and w are 
conjugate by a word x of length at most |m| /2 + \w\ /2 + C'^. Thus, there 
exists a minimal van Kampen diagram D bordered by wx~^u~^x. It follows 
from the isoperimetry in G that \D\ ^ + \w\ + 2 \x\)/C. As \w\ ^ \u\ we 
have \D\ ^ \u\ (4 + 2C^)/C. 

Do the same job with v and w, to get a diagram D' bordered by v~^y~^wy. 
Then paste these two diagrams along the w's, getting a diagram bordered 
by v{xy)~^u~^{xy). Then transform this diagram into an annulus by gluing 
the two xy sides; this leads to a one-hole diagram bordered by u and v. The 
number of its faces is at most (|n| + |f|)(4 + 2C'J/C and we conclude by 
setting C = C / {4 + 2C'J in case u and v are geodesic. 




In case u is not geodesic, let u' be a geodesic word equal to u in G. 
We know there exists a van Kampen diagram bordered by uu'~^ , with 
\Du\ ^ 2\u\ /G. Let be a similar diagram for v. Let D be as above a 
one-hole minimal diagram bordered by u' and v' , with \D\ ^ {\u\ + \ v\)/G' 
with C" as above. Then we can glue Du and to D along their common 
boundaries. 




This leads to a diagram with at most + |?;|)/C" + 2(|m| + \v\)/G faces, 
and we conclude by re-setting G' to 1/(1/C" + 2/C). □ 
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B.4 Narrowness of diagrams 

We now prove that diagrams (with or without holes) in a hyperbolic space 
are narrow (see section [T] for definitions). 

Let a = 1/ log(l/(l — C'/A)) where C is given by Proposition IHSl (Recall 
A is the maximal length of relators in the presentation of G.) Let \x] denote 
the integer part of x plus one (such that [log|i5|] = 1 for |D| = 1). 

Proposition 66 - Let D be a minimal diagram with either or 1 hole. 
Then D is \a\og\D\]-narrow. 

Proof - Let -D be a minimal van Kampen diagram with or 1 hole. Propo- 
sition IHSl tells us that C \D\ ^ \dD\. Let n be the number of faces of D lying 
on the boundary. We have \dD\ ^ \n. Thus the proportion of faces of D 
lying on the boundary is at least C'/A. 

Let D' be the diagram D with the boundary faces removed. (In case D' 
is not connected, consider any one of its connected components.) D' has 
at most one hole. D' is minimal as a subdiagram of a minimal diagram. 
Furthermore, we have \D'\ ^ |Z}| (1 — C'/A). By the same argument, the 
proportion of boundary faces of D' is at least C'/A, and after removing 
these faces we get a third diagram D" with at most \D\ (1 — C'/A)^ faces. 
Repeating the argument yields the desired conclusion as D is exhausted after 
log |D|/log(l/(l- C'/A)) steps. □ 

Proposition 67 - Let D be a minimal n-hole diagram. Then D satisfies 
the isoperimetric inequality 

\dD\^C\D\ -nX{2 + A\a\og\D\]) 

Proof 

Lemma 68 - Let D be a minimal n-hole van Kampen diagram (n ^ 1). 
Either there exists a path in the 1-skeleton of D joining two holes, with 
length at most A(l + 2 [a log |-D|]), or there exists a path in the 1-skeleton of 
D joining one hole with the exterior boundary, with length at most A(l/2 + 
\a\og\DW). 

Proof of the lemma - We work by induction on n. Set e = [alog|D|]. 

Observe that a chain of adjacent faces provides a path of length at 
most NX/ 2 in the 1-skeleton between any two vertices of these faces. 

For n = 1, the lemma is clear: by the last proposition, the diagram is 
e-narrow, thus the two components of the boundary are linked by a chain of 
at most 2e faces, providing a path of length at most Ae. 

Now suppose the lemma is true up to some n ^ 1, and let D he a {n + 1)- 
hole van Kampen diagram. For every hole i, let Bi be the set of faces of D 
lying at distance at most 2e + 1 from the boundary of i. 

Either, first, there are holes i j such that Bi and Bj have a common 
face or edge or vertex. This provides a chain of at most 4e + 2 faces between 
the boundaries of holes i and j, thus a path of length at most A(2e + 1). 
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Or, second, the BiS do not meet. Choose any hole i. 

There can be holes in different from i, that can be filled in D. Define 
Bl as Bi plus the interiors of these holes in in such a manner that all holes 
of B'- are holes of D. 

First, suppose that B'^ does not encircle any hole j of D other than i. 
As Bi is defined as the ball of radius 2e + 1 around i in D, any face on the 
exterior boundary of B'^ is either a face at distance 2e + 1 from hole i, or a 
face on the boundary of D. But as B'- is a one-hole van Kampen diagram 
included in D, hence e-narrow by Proposition EEl not all faces of the exterior 
boundary of Sj' can be at distance 2e + 1 from i. That is, at least one face of 
the exterior boundary of B'^ is on the exterior boundary of D, hence a path 
of length at most A(e + 1/2). 

Second, imagine that B^ encircles at least one hole j ^ i of D. Consider 
the part D' of D comprised between B[ and j, that is, the connected compo- 
nent oi D\ B[ containing j. This is a diagram with at least one hole j (and 
maybe others), but as it does not contain i it has at most n holes. As D is 
minimal, D' is. By the induction assumption, either two holes in D' are at 
distance at most A(2e +1), in which case we are done, or one hole, say j, in 
D' is at distance at most A(e -|- 1/2) of the exterior boundary of D' . But the 
exterior boundary of D' is part of the boundary of B[, any point of which is 
at distance at most A(e + 1/2) of hole i. Thus i and j are linked by a path 
of length at most A(2e + 1), which ends the proof of the lemma. □ 

Corollary of Lemma 1681 A minimal n-hole diagram can be made 
simply connected by cutting it along n curves of cumulated length at most 
n\{2\alog\D\'] + 1). 

The corollary of the lemma ends the proof of the proposition. □ 

Corollary 69 - A minimal n-hole diagram D is \a log | D |] +n(4 [a log | 1] + 
2) -narrow. 

Proof - Let D' be a simply connected van Kampen diagram resulting from 
cutting D along curves of cumulated length at most nA(2[Q;log |Z^|] + 1) 
(which run along at most n(4[a log + 2) faces as can immediately be 
seen on the proof above). Every face in the new diagram is at distance 
[a log from the boundary of D' by Proposition l66l The boundary of D 
is a subset of the boundary of that of D' , but by construction any face on the 
boundary oi D' is at distance at most n(4[alog +2) from the boundary 
oiD. □ 

B.5 Coarsenings of diagrams 

If -D is a very narrow diagram with holes, then we have an intuitive feeling of 
which parts of its boundary "face" which. This intuition can be made clear 
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using the approximation of hyperbolic spaces by trees. We now give for this 
intuition a mathematical setting fitted to our needs. 

Definition 70- Let wi,...,Wn be n geodesic words in G. A {k,e)- 
matching of these words is a set of words w[, . . . , w'f^, some of which may be 
empty, together with a partition Ii, I2 of {1, ... , k} and a bijection ip between 
Ii and I2, such that: 

• The words w[, . . . ,w'f, form a partition of the words Wi, . . . ,Wn- 

• For all i G /i, there exist words 61 and 62 of length at most e such that 
w[ = Siw'^i^--j62 in G (we will say that w[ and w'^f^^-^ e-matchj. 

This means that we cut the words Wi into at most k subwords such that 
each subword "faces" another one up to e. Typically e is of order 5. We have 
to leave open the possibility that some w[ are empty since, for example, if 
one of the Wi's is very short, it could have to be matched with the empty 
word. 

The following proposition is basically equivalent to the approximation of 
finite hyperbolic spaces by trees. 

Proposition 71 - Let wi, . . . , Wn, for n ^ 2, be n geodesic words in G 
such that Wi . . . Wn = e. There exists a (4n, inS) -matching of these words. 

For n = 3 this closely resembles the definition of thin triangles. 
Proof - Work by induction on n. The result is clear for n = 2. Suppose 
that n + 1 words Wi, . . . ,Wn, Wn+i forming a closed piecewise geodesic path 
in G are given. Let x be a geodesic word equal to WnWn+i- The three 
geodesic words x, Wn, Wn+i form a 5-thin triangle. Let x = X1X2 where the 
endpoint of Xi lies at distance at most 26 from both sides Wn and Wn+i of the 
triangle. Now apply the induction assumption to Wi, . . . , Wn-i, x. This gives 
a matching involving a partition of the word x into subwords x[, i E I. At 
most one of the words x[ straddles the endpoint of Xi. If some x[ is included 
in Xi or X2 and is 4n(5-matched to w' where w' is a subword of the Wj's, 
then using thinness of the triangle x, w„, Wn+i we can (4n + 2)5-match w' 
with a subword of Wn or Wn+i- If x- straddles the endpoint of xi, and x[ is 
4n(5-matched to ty', then we can write x- = x^'x^" where the endpoint of x-' is 
that of xi, and also write w' = w"w"' such that w" (4?7, + 2)5-matches with x'l 
and w'" matches with x'l'; using thinness of the triangle x, Wm w^n+i we can 
(4n + 4)(5-match w" and w'" with subwords of Wn and Wn+i respectively. Last, 
the two parts of Wn and which are not 25-close to x can be matched 

together. □ 

Doing the induction more cleverly, one can even obtain a (4n, 4(51og2 n)- 
matching. 

We are to apply this construction to diagrams with n holes. In order to 
symmetrize the role of holes and of the boundary in the following proposition, 
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we view a n-hole diagram as a (n + l)-hole diagram embedded in the sphere. 

Proposition 72 - There exists a constant B (depending on G) such that, 
for any minimal {n + l)-hole diagram D embedded in the sphere, there exists 
a {8n, Bn\og\D\)-matching of the boundary words of D. This matching is 
called the coarsening of D. 

The coarsening of D is best visualized as a planar graph as in the following 
picture (black dots mark the points where we partition the boundary words). 
The planar graph can be precisely defined but we do not need it. 




Proof - First, using CorollaryinU and the fact already used above that any 
geodesic joining the endpoints of a curve of length £ stays (51og£ + l)-close 
to that curve, we can suppose that the boundary words of D are cyclically 
geodesic: the length I of any boundary word of D is at most \ \D\ and so a 
{k, £:)-matching for the cyclically reduced words will give a {k,e + 2 + C'^ + 
251og(A |Z^|))-matching of the original words. 

Use the corollary of LemmalMlto cut D into a simply connected diagram 
D' . The boundary word of D' is made of n' ^ 2n pieces Wi, . . . ,Wn' parti- 
tioning the boundary words of D, with little words Xi, . . . ,Xn' of cumulated 
length at most Bin\og\D\ in between (for some constant Bi depending on 
G). Define yi to be a geodesic word equal to WiXi. 

Now apply the previous proposition to get a (4n', 4?T,'(5)-matching of the 
words yi, 1/2, yn'- Since the Xj's are of cumulated length at most 
Bin\og\D\, this matching defines a (4n',4?7,'(5 + 25 + i?i?7,log |_D|)-matching 
of the Wi's. □ 
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C Appendix: Cases of harmful torsion 



Here we show that the assumption of harmless torsion cannot be removed. 
Examples of hyperbolic groups with harmful torsion include such groups as 
(Fm X Z/2Z)-k Fm with m ^ 2, since the Z/2Z factor has a centralizer which 
is a free group of rank m. More precisely: 

Proposition 73 - Theoreni\^does not hold for the hyperbohc group (F4 x 
Z/2Z)^F4. 

Proof - Consider the two groups Gi = {Fm x Z/2Z) Fm and G2 = 
{Fm ^ Fm) X Z/2Z. In each of these, denote by u a generator for Z/2Z 
and respectively by ai, . . . , and bi, . . . ,bm a standard set of generators for 
the first and second factor Fm- Let Ai and A2 be the subgroups of Gi and 
G2, respectively, generated by the aj's, and define Bi and B2 similarly. 

It is immediate to see that these groups are hyperbolic. 

For any group G generated by k elements, let X{G) denote the spectral 
radius of the random walk on G (w.r.t. the k generators), and let 6^ = 1 + 
\0g2k A be the gross cogrowth of G. 

The spectral radius for the free group F^ is A(Ffc) = \^2k — 1/k. By 
Lemma 4.1 of |Klj . the spectral radius for the group F^ x Z/2Z is equal to 
(1 + kX{Fk))/{k + 1) = (1 + ^/2k^)/{k + 1). 

In particular, the spectral radius of G2 is (1 + y/Am — l)/(2m + 1). 

Take m = 4. We have 6{G2) = 1 + log4^+2 ~ -788. In particular, 

the critical density d^^.^^ for random quotients of G2 by plain random words 
is about 1 - .788 = .212. 

Since G2 is a quotient of Gi we have of course \{Gi) ^ \{G2)- But 
Theorem 1 of |K1| states that quotienting a group by (the normal closure of) 
a non-amenable subgroup strictly increases the spectral radius. The kernel 
of the quotient map Gi — > G2 contains the two elements ubiu~^bi~^ and 
ub2U~^b2~^ which generate a free non-cyclic subgroup in Z/2Z^i?i. Hence 
the kernel is non-amenable. 

Thus, we have X{Gi) < A(G2), so that if Theorem HI holds for Gi, the 
critical density rfj^j^ for random quotients of Gi satisfies 

•^crit ^ ^crit ~ -212 

But we are going to prove that random quotients of Gi are very probably 



trivial as soon as d > (R, 



crit • 



Let -R be a set of randomly chosen words in u, oi, . . . , a^, foi, . . . , 6m, of 
length at density d. (Note that for the model of random quotients by plain 
random words, the law of the relators depends only on the generators and 
not on the initial group.) We now study the random quotient Gi/ (R) and 
consider the elements of R as elements of Gi . 

Let us compute the probability that one of the relators in R belongs 
to C = Ai X Z/2Z C Gi. The number of words of length i belonging 
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to C is at least (2m + 2)^, so that the corresponding density is at least 
lo §4771+2(2'^ + 2) ^ .797. So there exists a density (ic* ^ 1 — .797 — .203 such 
that \i d > dc, there will very probably be some element of R lying in C . 
Note that this is below the critical density d\^-^^ for random quotients of Gi 
predicted by Theorem |31 (if it holds). Also, dc is not since Gi/{C) is not 
amenable. 

By the same argument, for d > dc it is very probable that R contains 
a relator r of the form r = xc where x is one of the generators of Gi and 
c is a word of length i — 1 with c G C. As the random words are sampled 
uniformly, when £ is big enough this will occur for each of the relators x of 
Gi. 

Let H be the random quotient G/{R). By definition of C, u commutes 
with c in Gi, so in H we have 

uxu^^x~^ = uxcu~^c~^x~^ = uru~^r~^ = e 

since r = e in if by definition. 

Thus, in H, the generator u commutes with all the generators of Gi. Let 
S G Gi he the set of the commutators of u with these generators, we have 

H = G,/{R) = G,/{R US) = {G^/{S)) /{R) = G^/iR) 

But G2/ {R) is a random quotient of G2 (this is because for random quo- 
tients by plain words, the law of R is independent on the initial group). In 
particular, if d > d^^.^^ ~ .212 this group will very probably be trivial, whereas 
if Theorem m were valid for Gi, the critical value would be dl^.^^ > d^^.^^. 

This ends the proof. □ 



So random quotients of Gi behave in a different manner than that of 
Theorem HI For densities between and dc < -203 they behave "normally" 
(in particular. Axiom 4 is satisfied). But for densities between dc and .212, 
Axiom 4 is not satisfied, and the random quotients behave like random quo- 
tients of G2, and they vanish as soon as d > .212, whereas the expected 
critical density in Theorem HI would be higher. (The gap between .203 and 
.212 can be made larger by taking bigger m.) 



Random quotients of Gi are 

- 

Infinite Infinite Trivial 

ubi 7^ biu ubi = biu 



The two phases are really different: indeed, a difference can be seen in 
the ball of radius 2 in the Cayley graph since, in the random quotient, the 
relation ubi = biu (notation as above) holds in the second phase but not 
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in the first one (since in the first phase, the "ordinary" theory of random 
quotients holds and in particular, the radius of injectivity grows with £). 

More than three phases can probably be arranged, using groups such as 

(((F^ X Z/2Z) Fp) X Z/2Z) ^ Fq 

with different critical densities equal to the densities of the centralizers of the 
different torsion elements. 
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